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Preface 


This text is intended for the beginning graduate student with minimal prep¬ 
aration. However since Lie groups abstract the analytic properties of matrix 
groups, the student is expected to have some knowledge of senior level algebra, 
topology, and analysis as given in some of the references. In Chapter 1 we review 
some advanced calculus and extend these results to manifolds in Chapter 2. 
Consequently the reader knowing these results can skip these chapters but should 
pay attention to the examples on matrix groups. After this the reader probably 
should follow the order given in the contents noting that the first part of the 
text is about Lie groups while the algebraic study of Lie algebras begins in 
Chapter 9. We have not attempted to prove all basic results so the serious student 
should take the indicated detours to such texts as those by Freudenthal and 
de Vries, Helgason, Jacobson, or Wolf. In particular the student must develop 
his own taste in this subject and ours is only one point of view. 

We are deeply indebted to the many authors and teachers in the subject and 
to them we express our gratitude. We would like to thank Dr. J. R. Schumi for 
his assistance, Professors Harry Allen, Charles Conatser, and Earl Taft for their 
timely suggestions, and the National Science Foundation for their support of 
mathematics. Also thanks are due to Barb Ketter and Els Sagle for their excellent 
manuscript preparation and to the people at T.S.V. and at the University of 
Minnesota for their encouragement. 



CHAPTER 1 

SOME CALCULUS 


We shall present many familiar concepts of differential calculus in the 
terminology of linear algebra. Thus, for functions from one Euclidean space 
to another, derivatives are given as linear transformations, higher order 
derivatives are given as multilinear forms, and Taylor’s series is presented in 
this terminology. Instead of giving detailed proofs we present many examples 
involving matrix groups which will be abstracted in later chapters. 


1. Basic Notation 


We now informally review some basic concepts with which the reader 
should be familiar. Thus let V denote an w-dimensional vector space over a 
field K and let X u ..., X n be a basis of V. Then any point or vector p (or P) 
in V can be uniquely represented by 

P = tPi X i 

i= 1 

and we call the p t e K the coordinates of p relative to the basis X<, ..., X n 
of V. In particular when we let e k = (0,..., 1,0.. .0) with 1 in the kth position 
and use the basis e x , ..., e n , then we frequently write p = (p u as a 

vector in V. For a fixed basis X u ..., X n of V, the functions 

Ui : V-*K:p 


1 
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for / = 1, ..., n are called the coordinate functions for F relative to the basis 
X l9 ..., X n . Thus we obtain a “ coordinate system” by a choice of basis in F 
and in particular obtain the usual coordinate system by choosing the e t ,..., e„ 
basis of F. 

Let IT be an m-dimensional vector space over K and with F as above let 

Hom K (F, W) 

or just Hom(F, W) denote the set of linear transformations of F into W. 
Thus 

T \ V -+W \ X -> T(X) 

is in Hom(F, W) if T(aX + bY) = aT(X ) + bT(Y) for all a, b e K and 
X, Y e V. In particular Hom(F, W) is a vector space over K of dimension 
m • n relative to the usual definitions: For 5, T eHom(F, W) and a, b e K 
define (aS + bT)(X) = aS(X) + bT(X) for all X e V. We shall also use 
the notation 


L(F, W) for Hom(F, W) 

and 

End(F) for Hom(F, F). 

Now let K = R, the real numbers, and let V = R n which we regard as the 
set of all w-tuples X = (x u ..., x„) with x ( e R and with the operations 

aX + bY =(ax l +by u ... 9 ax n + by n ) 

f° r Y = (Jij • • • 5 y n ) ^nd a, b e R. With this representation of F we have a 
natural inner product 

B : V x F ->R 

given by the formula 

B(X , Y)=t x iyi . 

i= 1 

Thus for Z, 7, Z e F and a, b e R, B satisfies 

(1) B(aX + bY,Z) = aB(X,Z) + bB(Y,Z)\ 

(2) 5(Z, 7) = 5(7,^); 

(3) B(X , Z) > 0 and ^(Z, Z) == 0 if and only if Z = 0. 

Any function B : V x V R satisfying (1) and (2) above is called a symmetric 
bilinear form and if it also satisfies (3), B is called a positive definite symmetric 
bilinear form. 
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A norm on a vector space V is a function n : V -+R satisfying 

(1) n(X) > 0 and n(X) = 0 if and only if X = 0; 

(2) if a e R and X e V, then n(aX) = \a\ n(X ); 

(3) n(X + Y) < n{X) + n(Y) for all X, Y eV. 

We shall also use the notation 

n(X)= \X\ = \\X\\. 

In particular, if B is a positive definite symmetric bilinear form on the vector 
space V over R , then 

||X|| = B(X, XY' 2 

is a norm on V and we have the inequality 

i b ( x , y )i < min. 

Using a norm on V we can define a metric d on V by 
d(X, Y) = II* - Y\\ 
for X, Y e V. Thus d satisfies 

(1) d(X, Y) > 0 and d(X, Y) = 0 if and only if * = Y; 

(2) d(X, Y) = d(Y, X); 

(3) d(X, Y) < d(X, Z) + d(Z, Y). 

In particular, with ||*||=5(*, *) 1/2 =(X*j 2 ) 1/2 we obtain d(X, Y) = 
HiXi-Yi) 2 ] 112 - 

We now consider some of the topological properties of V — R n which 
arise from a metric d obtained from a given fixed norm. Thus we define 
the open ball of radius r with center p by 

B(p, r) = {XeV: d(p 9 X) < r} 

and say subset S of V is open in V if for every p e S there exists r > 0 so that 
the open ball B(p , r) is contained in S. Using this definition we obtain the 
basic results on the metric topology of R n with which we assume the reader is 
familiar. 

Notice that it really does not matter which norm we start with when 
considering the topological properties of V = R n relative to a metric induced 
by a norm. Thus if n 1 and n 2 are norms on V , we can show that there exist 
constants a and b in R so that for all X e V 

an t (X) < n 2 (X ) < bn^X); 

that is, and n 2 are equivalent norms. Thus, if d k is the metric determined by 
the norm n k , then using the above inequality it is easy to see that the open sets 
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of V relative to d l are the same as the open sets relative to d 2 [Dieudonne, 
1960; Lang, 1968]. Most often we shall determine a norm n on Fby choosing 

a basis . X, of Vand for X = X t set n(X) = Q>, 2 ) 1/2 . In particular 

we obtain the usual inner product and norm by taking the basis e u ..., e„ of 
V = R n and call the vector space R" with the topology induced by this norm 
Euclidean n- space. 

With the topology in V = R n induced by a norm n as above we now note 
that V is complete. Thus a sequence of vectors {x k } in V is called a Cauchy 
sequence if given any e > 0 there exists N so that for all p,q>N we have 

n(x p - x q ) < e. 

We have the result that every Cauchy sequence in V has a limit; that is, V 
is complete [Dieudonne, 1960; Lang, 1968]. Let {x k } be a sequence in Land 
kt Zj, ...,Z„bea basis of V. Then we can write 

x k = X kl Xi + h x kn X„ 

and note {x k } converges if and only if each sequence {x ki }, i = 1, n, 
converges. Thus, by a skillful choice of a basis, it might be obvious that the 
sequences {x ki } converge so that {x k } can be shown to converge easily. 

Let {.x*} be a sequence in V = R n . Then the series of vectors ^ x k in V 
converges if the sequence {^ p } given by the partial sums 

p 

Sp = Z Xk 

k= 1 

converges. Now associated with any series in Fis the series of real numbers 

Hn(x k ) 

formed by taking the series of norms of each term. We have the following 
expected results [Dieudonne, 1960; Lang, 1968], 

(1) If the series Xn(x*) converges in R, then the series converges in 
the Euclidean space R ; in this case we say converges absolutely, 

(2) If converges absolutely to the limit a, then the series obtained 
by any rearrangement of the terms also converges absolutely to a. 

^ et V = R n and W = R m be Euclidean spaces as previously discussed 
and let U be a nonempty subset of V. Then for a choice of basis Y, ... Y 
of IF a function 

f:U-*W:p~* f(p) 
has the coordinate representation 

m 

f(p) = YA(p)Y k . 

k=l 
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The coordinate functions for / 

f k :U^R:p-+f k (p) 

can be used to describe the boundedness or continuity of the function 
/ ; U -► W as follows. 

A function f: U-+W is bounded on U if there exists a real number 
C > 0 so that «(/(«)) < C for all wet/, where n is the Euclidean norm in W, 
If we let 

B(U, W) 

denote the set of bounded functions on U into W, then a straightforward 
computation shows B(t7, W) is a vector space of functions over R relative to 
the usual operations 

(af + bg)(p) = af(p) + bg(p). 

Furthermore it is easy to see that / e B(t7, W) if and only if each coordinate 
function f k e B(U, R ). 

Similarly with the usual definition of a continuous function / : U -> W 
we find that the set of all such functions is a vector space of functions over R 
which we denote by 

C(U, W). 

In particular, for a function / : U -> W, we have that / e C(U , W) if and only 
if each coordinate function f k e C(U , R). We shall frequently use the notation 

C(U) = C(U, R) 

and note that C(U) is closed under the pointwise product 

(fg)(p) = f(p)g(pl 

Thus C(U) becomes an associative algebra relative to this product. 

The vector space of bounded functions B(U , W) has a sup norm given by 

ll/ll = sup{||/(jc)|| : X e U}, 

and the uniform convergence of a sequence {/„} of functions 

fn'U^W 

is related to the norm as follows. Recall {/„} converges uniformly on U to a 
function / : U -► W if for every e > 0 there is an integer N such that n> N 
implies for all x e U 


ll/ B (*) “ /Mil < £ - 
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Consequently in terms of norms, {/„} converges uniformly to / on U if for 
every e > 0 there is an integer N such that for all n > N,f„ - f e B (U,W) and 

ll/.-/ll<«. 

Using this together with the fact that if U is compact, then C(U) e B( U, R) 
we obtain the following summary of results [Dieudonne, 1960; Lang, 1968]: 

Theorem Let V, W be Euclidean spaces and let U be a nonempty subset 
of K 

(a) The space B(£/, W) with sup norm is complete. 

(b) If {/„} is a sequence of functions in C(U, W) which converges uni¬ 
formly on U to a function f; U ->W, then / e C(U, W). 

(c) The space BC(U, W) of bounded continuous functions from U 
into W is complete and closed in B(£/, W) (in the sup norm). In particular, 
if U is compact, this applies to C(U) = BC(U, R). 

(d) (Weierstrass test) Let {/„} be a sequence in B({/, W) so that there 
exist real numbers M n with ||/,|| < M n and £M„ convergent. Then the series 
LL converges absolutely and uniformly. Furthermore if each /„ is continuous 
on U, then £/„ is continuous on U. 

Remarks (1) If we use the field K = C, the complex numbers, then 
results analogous to those discussed in this section also hold. However, 
we must use a Hermitian form instead of the inner product to define the 
metric. Thus if V = C", a Hermitian form is a mapping H : V x V -*■ C 
satisfying for X,Y,ZeV and a e C, 

(i) H(X, Y + Z) = H{X, Y) + H(X, Z) ; 

(ii) H(aX , Y) = aH(X, Y) and H(X, aY) = aH(X, Y) where the over- 
bar denotes the complex conjugate; 

(iii) H{X, Y) = H( Y,X)- 

(iv) H(X, X) > 0 if X # 0 and H(X, X) — 0 if and only if X — 0. 

Note that from (iii) H(X, X) - H(X, X) so that H(X, X) e R. Thus (iv) 
allows us to define a norm by ||X|| = H(X, X) 1/2 and consequently a metric. 
In particular, forX = (*,,..., x„) and Y= (y u y n ) in V = C", H(X, Y) = 
L x kYk defines a Hermitian form and || X |, = x t ) 12 . 

(2) We use the following convention to obtain a matrix for 

T e Hom(F, W). 

Let X u ...,X n be a basis of V and let Y,,..., Y m be a basis of W and 
let T(X i) = L ?-1 a ji Yj ■ Then the matrix for T relative to these bases is 
(aji); that is, the ith column is the set of coefficients obtained from the above 


1. BASIC NOTATION 


7 


expression for T(Xi). In particular, the various canonical forms will have 
upper triangular matrices. 

Exercises (1) Let X u ..., X n be a basis of V over C so that any T e 
End(7) = L(V , V) has matrix (/ y ) relative to this basis. Show 

IIT’ll = sup{|/ 0 |} 

defines a norm on End(F). What happens when one changes basis? 

(2) For T g End(F) show that 

lir|| = sup{||rx||/||x||:x#0} 

defines a norm on End(F) such that 

||r|| > max{|r 1 |,|r„|} 
where the r f are the characteristic roots of T. 

Example (1) We now consider some series in End(7). First we note 
that for T e End(F) the sup norm 

imi = sup{||r(X)||/M : x * 0} = sup{||r(Y)||: IIH = 1} 
satisfies, for S, T e End(7), 

IISTII < l|S||||T||. 

In particular, ||r n || < ||T||". Next in discussing the convergence of a series 
Yfin T n in End(F) we make a choice of basis in V for which the matrix T has a 
desirable form. Then we can define a norm on V induced by this basis which 
can be used to compute the sup norm on End(F) in terms of the matrix for T. 
Thus if T is represented by matrices A and B relative to different bases so that 
B = PAP" 1 , then using the norms defined by these basis we see 
exists if and only if B n exists. This is because 

5 >„*” = YfiniPAP- 1 )* = P(Ya n A n )p-K 

Now note that we can regard the real vector space V as contained in a 
complex vector space W over C so that the norm in V is induced by a norm 
in W given by a Hermitian form. Thus we can use the complex canonical 
forms for the matrix of T to investigate convergence of Yfin T n - We use this 
now to sketch a proof of the following results [Jacobson, 1953, Vol. II]. 

Proposition Let Yflj 2 * be a power series over the complex numbers C 
with radius of convergence p and let T e End(IT) be such that its character¬ 
istic roots r k satisfy |r*| < p. Then the power series ]Ta 7 T j exists in End(JF). 
In particular this result holds for real power series. 
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Proof Choose a basis of W so that the matrix A of T has canonical form 

X 

A 2 0 


where each block has the form 


V 1 


w 


where r is a characteristic root of T . Consequently, since the matrix of the 
powers T k have the same block form, it suffices to consider the series for a 
matrix A of the form (*). Now if 

k k 

Sk(z) = X ajz*, s k '(z ) = £ jdjZ y_1 , etc., 

j = 0 ,, = 0 

then the £th partial sum s k {A) for the series is of the form 

s k (r) s k '(r) 

Sk(r) s k '(r) 

0 s k (r) 


Thus if | rJ < p, then the sequences {s k (r)}, {^'(r)}, etc., converge to s(r), 
s'(r), etc., so that the sequence {s k (Aj} converges; that is, Y. a kT k exists. 

Wc apply this to the complex power series 

e 2 = l +z + ^z 2 + ■■■ 

which converges for all z e C. Thus for any T e End(F) we note that the 
characteristic roots of T are bounded so that 


exp(r) =e T = I+T +^T 2 + ■■■ 
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exists. Thus we have the mapping 

exp : End(L) -► End(F): T ->e T 

called the exponential mapping (for endomorphisms). 

If T has a matrix A of the form (*) above, then exp A is a matrix of the 
form 


exp r exp r 
exp r 

0 

Thus in general exp T is computed from blocks of such matrices. 

Exercises Let V = R n be Euclidean n-space and let T e End(L). 

(3) Using the above canonical form for exp T\ show det(exp T) — e trT , 
where trT— trace T is the sum of the characteristic roots of T. Thus the 
exponential map has domain End(F) and range contained in GL(V) the 
group of nonsingular endomorphisms of V. We also use the notation 
GL(n , R) for GL(V). 

(4) Show that the series for exp T is absolutely and uniformly convergent 
on any closed ball in End(F). (If possible, try not to use the Weierstrass test 
but estimate directly). Thus show exp : End( V) -> GL(V) is continuous. 

(5) Show GL(V) is an open subset of the Euclidean space End(L). 


exp r 
2! 

exp r 
exp r 


2. The Derivative 


In this section we formulate the basic facts on differentiation in terms of 
linear transformations. We do not give many proofs so the student should 
regard many of the statements as exercises or refer to Dieudonne [1960], 
Lang [1968], or Spivak [1965]. 

Definition 1.1 Let V and W denote the Euclidean spaces R n and R m y 
respectively, and let U be an open subset of K Then the function 

f:U->W 
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is differentiable at p e U if there exists a linear transformation T e L(V, W) 
so that for all X e V the limit 

I f(p + x) - ftp> - Ttxn 

1™-m- 

where || || denotes the usual Euclidean norm in V or W. 

Thus for X sufficiently small in V we have the approximation for a differ¬ 
entiable function 

f(p + X) = f(p) + T(X) + ||X||e(X), (*) 

where e(X) is a function such that 

lim e(X) = 0. 
x-»o 

If such a linear transformation T exists, then T is called the derivative of / 
at p e U and is denoted by Df(p), f'(p), or df(p). Thus 

Df{p)X = T(X) 

for all X e V. This definition uses the uniqueness part of the following result. 


Proposition 1,2 Let / : U -* W be differentiable at p e U and let 
T l ,T 1 e L(V, W) satisfy 

\\f(p + X)-f(p)-T i (X)\\ A 

!™„-m- =0 

for i=l, 2 and all X e V. Then T y = T 2 = Df(p). Furthermore for all 
X e V 


Df(p)(X) = lim - [f(p + tX) - f(p)]. 

t->o t 


Proof For 0 # X e V let d(X) = f(p + X) — f(p). Then we have 

||T,(X)-r 2 (Z)|| \\T\(tX) - T 2 (tX)\\ 

IIXII \\tx\\ 

= lim IZiM) - d(tX > ± d(tX ) - TjWn 

™ \\tx II 

^ lim ||s 1 (?A') + e 2 (rz)|| = 0, 
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where we use the approximation (*). This implies T t (X) = T 2 (X) for all 
X e V. Using (*) we also have 

Df{p){X) = lim \ Df(p)(tX) + lim ) |/X|«(tf) 

= lim | [f(p + tX) - f(p)]. 

f-»0 t 


Examples (1) Let /: K-* W be a linear transformation. Then Df{p) = f 
for all psV ; For let X e V 9 then f(p + X) - f(p) = f(X). Thus for T = f 
we see f(p + X) — f(p) - T(X) = 0 so that the limit in the definition is 0 
and by uniqueness Df(p) = f 

(2) Let V = W = R and let /:£/-► IF be differentiable at p in the usual 
calculus sense with 

f'(p) = lim 7 U{P + 0 - f(p)l 

t~*0 t 

Then the linear transformation T : R^>R: X -> /'(0^ given by multi¬ 
plication satisfies Definition 1.1. Thus 2)/(p) has the 1 x 1 matrix (/'(p)) 
relative to the basis of 7? consisting of the number 1. 

Proposition 1.3 Let V and W be Euclidean spaces with U an open 
subset of K 

(a) If / : U -> W is differentiable at p e U, then / is continuous at p. 

(b) If /, g : U -+ W are functions differentiable at p e U and if a, b e R , 
then af + bg is differentiable at p and 

D(af + 6 #) 0 ) = a/)/(p) + bDg(p). 

(c) Let Fj, F 2 be Euclidean spaces and let B : V x x F 2 W be a bilinear 
map. Suppose/ : U -+V t and g : C/ F 2 are differentiable at p e t7. Then the 
“product” £(/, g)\U ^ W: u^B(f(u), g(u)) is differentiable at p and for 
IgF, 

[/)(*(/, p))0)](X) = B(f'(p)X, g(p)) + 2?(/(p), g’(p)X). 

(d) (Chain rule) Suppose U i is open in V l9 U 2 is open in V 2 , and 
/ : U 1 -+ U 2 and g : U 2 W. Let p be an element of U t such that/is differenti¬ 
able at p and g is differentiable at/(/?). Then the composition g o / : U x -► W 
is differentiable at p and 

D(g ° f)(p) = Dg(f(pj) ° £>/(/>). 

Thus the derivative of the composition is just the composition of the deriv¬ 
atives regarded as linear transformations. 
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Proof of (d) Let 0 ^ X e V t be near 0 in V l9 let S' = Df(p ), and let 
T =(Dg){f(p)). Then 

f(p + X) =f(p ) + y 
where Y = Df(p)X + ||X||e(X) and 

g{f{p + X)) = g(f(p) + 7) = g(f(pj) + [Dg(f(p))]Y + || 7116,(7). 

Let A(X) = (g . f)(p + X) - (g ° f)(p) - T o S(X). Then 

l|A(*)||/imi = \\g(f(p + X)) - g(f(p )) - T 5(^)11/11^11 
= \\[Dg(f(p))]Y +117118,(7) - TS(mm 
= ll[AK/00)](ll*l|e(*)) + 117116,(7)11/11X11. 

Thus for M = \\Dg(f(p))\\ and N = \\Df(p)\\, the norm of a linear transform¬ 
ation as in Section 1.1, we have 

|| 71| = \\Df(p)X + ||X||6(X)|| 

< \\Df(p)X\\ + ||X||||6(Z)|| <{N+ ||6(X)||)||X||. 

Consequently 

||A(X)||/||X|| <M||6(X)|| +(iV+ ||6(Z)||)||e,(7)|| 

so that 

lim ||A(X)||/||X||=0 
x-*o 

which yields g °f differentiable at p and D(g o f)(p) = T o S. 

Examples (3) Let B : V x x V t -> R be a bilinear form and let 
U=(- 1, 1 )c 

For fixed Y e V u let 

/: U-*V X :t->A(t)Y. 

where A(t) eEnd(F 1 ) is such that A : t ->A(t) is dilferentiable on ( — 1, 1) 
and ,4(0) = L Thus A(t) has a matrix (a 2 /0) such that a iy (0) = <5 i7 and the 
a u are differentiable functions on ( — 1, 1) to R. This gives a map 

g : U-+R: t -*■ f(t )) = B(A{t) 7, A(t)Y), 

and for p — 0 e U and X = 1 e V in the formula for the product rule we have 

[A7(0)](l) = [D{B(f, /)}(0)](1) 

= /(0)) + B(f( 0), /'(0)1) 

= 5(<4'(0)7, 7) + 5(7, T'(0) 7), 


2. THE DERIVATIVE 


13 


where, with the notation da tj ldt{ 0) = «y(0), A'( 0) has the matrix 

(da-Jdtm. 

In particular let A(t) be a 5-isometry; that is, for all Y e 
B(A(t)Y 9 A(t)Y) = B(Y i Y). 

Then since the derivative of a constant function is 0, we obtain 

B(A'(0)Y 9 Y) + B(Y ; A'(0)Y) = 0; 

that is, vt'(O) is skew-symmetric relative to i?. 

(4) Let f: R-*R and g : R^R be differentiable functions such that 
g o f = /rfy, the identity function on R. Then from the chain rule, idy — 
Dg(f(p)) ° Df{p) and applying this to the vector 1 e R we obtain 

1 = Dg(f(p )) • Df(p)(\) = g'(f{pj) ■ f(p) 

which gives the usual formula for the derivative of the inverse function. 

In order to compute a matrix for the derivative Df(p ) of a differentiable 
function f : U-> W we consider a coordinate representation for /. First let 

W = W x x W 2 x • x W k 

be a product of Euclidean spaces. Then / : U-*W yields the “coordinate” 
functions f t : U -> by 

/0») = (/O'). • • •. /O')) 

and we leave the following as an exercise. 

Proposition 1.4 Let U be an open subset of V and let 
/ : U -* W x x • • • x W k 

be given by / = (/ l5 ... ,/^) as above. Then /is differentiable at p e if and 
only if each coordinate map f t is differentiable at p; in this case 

Df(p) = (Df x {p),DUp)). 

In particular for Y u ..., Y m a basis of IF and / = Y.fj^j we have for X e V 
Df(p)(X) = Y [Dfj(p)(X)]Yj. 

j 

Next we use a factorization 

F = Fj x V 2 x ••• x F r 

as a product of Euclidean spaces and consider partial derivatives. Thus let 
x = (jc,, ..., x r ) e V be written in terms of its “coordinates” and for p l9 
..., p._ u pi+h ..., p r fixed we consider the map of an open set U t c F f 

fi : U { —► W : x f -> /(Pi,..., ,..., /? r ). 
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If f t is differentiable at p t e U i9 we call its derivative the ith partial derivative 
of/at 

P = U x X ••• x Ui x ••• x U r 

and denote it by D i f(p). Thus if this partial derivative exists, we have for 
X { e Vi that 

DJfpKXd = lim 7 [/(Pi> ..., Pi + tXi ,. .., p r ) - f(p u ..., Pi ,..., /V)] 
r-*0 t 

= Df(p)( 0 9 ...,X i 9 0 9 ... 9 0). 

In particular for IF = R and F = i?" = i? x • * • x R «-times, we use the familiar 
notation 


df/dXi(p) — Dif(p){e^ = Df(p)(€i\ 

where e { = (0, ..., 1, 0, ..., 0) with 1 in the ith position. However, “ partial 
derivatives ” can be defined relative to any basis of R n . 

Proposition 1.5 Let V = V x x • • * x V r be a product of Euclidean spaces 
and let Ui be an open subset of V- x . Let IF be a Euclidean space and let 

/ : x • x U r W 

be a function differentiable at p = (p u ..., p r ) e U i x ••• x U r . Then each 
partial derivative DJ(p) exists and for X — [X u ..., X r ) e x • * * x F r we 
have 


m 

Df(p)X=^D i f(p)X i . 

i = 1 

Proof We leave the proof to the reader as an exercise but the method 
should be familiar: Either adding and subtracting the same suitable terms 
from f(p + X) — f(p\ or showing that Df(p) exists implies Dj(p) exists, 
then using the linearity of Df(p) on X =^(0, .. X i9 0, ...,0) and the 
corresponding formula for Dif{p)Xi . 

In particular note that if IF = R and V — R x • * * x R with 

X = (x u ...,x„)=Y. x i e i 6 v ’ 

then we have 

Df(p)X = Y D if(p)( x i e i) =Y x i Sf/Sxiip). 

We now put together the above results to obtain a matrix for Df{p). 
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Theorem 1.6 Let V and W be Euclidean spaces with X u ..., X n a 
basis of V and Y u ..., Y m a basis of IF. Let V be an open subset of V and let 

f:U^W 

be a function which is differentiable at p e U and is given by the coordinate 
functions 


m 

/(*> = I fj(x)Yj 
j= i 

where x = £? =1 x t X t e U. Then Df(p) has the matrix 

( Difj(p)(Xi )) 

relative to the basis X u X n of V and Y u ..., Y m of W; this matrix is called 

a Jacobian matrix. In case we choose the basis e lf ..., e„ of V and e l9 ..., e m 
of W, then Df(p) has the matrix 

(dfj/dXi(p)). 

Proof For a typical basis element X ( we shall show 
Df(p)X i = ^[Df J (j>)(X ( )]Y J 

j 

which gives the desired matrix. Recall the convention that a linear trans¬ 
formation T : V -+ W has matrix (a^ relative to the above bases provided 

TiXd-YjajiYj; 

From Proposition 1.4 we have each coordinate function fj is differentiable 
and 

DfipXX,) = X [DfjpXXflYj = I [DJjWXftYj 

j j 

using Proposition 1.5 concerning the partial derivative D t fj{p) evaluated at 


Remarks (1) A more basic proof using the approximation (*) after 
Definition 1.1 is as follows. 


DfmXi) - X Dfj(p)(Xi)Yj = lim - [f( P + tx f ) - f(p) - X DftpX'XdYj] 

j t-o l j 


- lim - X [fj{p + tXd - ftp ) - LflWtXMYj 

t-K) t j 
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(2) The results of Propositions 1.4 and 1.5 can be used to find a Jacobian 
matrix for Df(p) in “ block” form. Thus let V = V t x V 2 and W = W x x W 2 
be factorizations into Euclidean subspaces and let U be open in V. Let the 
function f : U —> W be given by the coordinate functions 

fi : U-tWi and f 2 : U->W 2 . 

Then if / is differentiable at p e U, the linear transformation Df(p) has the 
matrix 


lA/tfp) D 2 f,(p) 

[DJ 2 {p) D 2 f 2 (p)Y 

where the D r f s (p) are represented by appropriate matrices. Note that some 
texts have the transpose of the above matrix for their derivative. This depends 
only on summation conventions and we shall write this in detail when neces¬ 
sary. 

Definition 1.7 Let U be open in V and let / ; U —> IV be differentiable at 
every point p e U. Then / is continuously differentiable on U or of class C 1 on 
U if the map 


U - Hom( V, W):p-> Df{p) 

is continuous. We denote the set of these continuously differentiable functions 
by 

C\U, W) 

and for W - R we use the notation 

C l (U) = C\U, R). 

A straightforward computation shows C\U, W) is a vector space and a 
differentiable function /: U-* W is in 0{U, W) if and only if we have a 
matrix representation (a u (p)) of Df(p) with all a tJ : U-*R :p-* a tJ (p) 
continuous functions. 

Generalizing the results concerning term-by-term differentiation of series 
we have the following result (also see Dieudonne [1960]). 

Proposition 1.8 Let V and W be Euclidean spaces with U open in V 
and let {/„} be a sequence of functions in C l {U, W) and let {£>/„} be the cor¬ 
responding sequence of derivatives. Assume there exists a function/ : U -* W 
so that for all x e U 


f(x) = lim f„{x) 

n~* oo 
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with pointwise convergence and assume that there is a function g : U -* 
Hom(E, W) so that 

Dfn^g 

with uniform convergence on U. Then / is differentiable on U and Dj'(p) = 
g(p) for all p e U. 

Exercises (1) Show the map 

f:GL{V)-+GL(V):T^T- 1 
is differentiable at P and 

Df(P)T = — P~ 1 TP~ l . 

Thus show / e C^GUV), End(E)). 

(2) (i) For T e End(K) show the function 

End(K): t -*■ exp (tT) 

is differentiable at any point p e R. 

(ii) Show the exponential function 

exp : End(E) -*GL(V) :T -> e T 

is of class C 1 on a suitable neighborhood U of 0 in End(K). 

(3) Prove the following version of the mean value theorem. Let U be 
open in V and let f \ U —♦ W be of class C 1 on U. Let p e U and X e V be 
such that the line segment p + tX for 0 < t < 1 is contained in U. Then 

II f(p + X)~ /(/>)|| < sup||2)/(?)||||X||, 

where the sup is taken over all q on the line segment. 

(4) Let B : R 2 x R 2 —> R be a bilinear form given by 

B((x i, x 2 ), (y u y 2 )) = x x y 2 -x 2 y,. 

Relative to the usual basis of R 2 let the following represent endomorphisms 
of R 2 

sw =[i ;]• 

C(t) = [* D(t) = A(at)B(bl)C(ct) 

for a, b, c e R , and t in a suitable interval in R. 
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(i) Show A(t), B(t\ C(t), D{t) are ^-isometries [see example (3), 
Section 1.2]. 

(ii) Compute D'( 0). 


3, Higher Derivatives 

Let U be open in V = R n and let / : U -> W be differentiable on U. Then 
Df : U -> L(V 9 W):p^Df(p) 

is a function defined on U with values in the vector space L(F, W) = 
Hom(K, W). Thus if the appropriate limits exist, we can define the second 
derivative at p e U by 

D 2 f(p) = D[Df](p) e L(V, L(V, W)). 

Next let l}(V, W) denote the space of bilinear maps from V x V into W. 
Then we can identify the elements of L(V, L(V y W)) with those of lI(V, W) 
as follows. Let X , Y e V. Then for B e L(V, L(V, Wj) we see that B(X) e 
L(V, W). Thus [B( .¥)]{ V"') = B'(X. Y) e W is linear in X and ) and therefore 
B' e L 2 (K, W). From this the mapping B -*■ B' is an isomorphism of the above 
spaces but we shall consistently use the above identification. In particular for 
a function / : U -*• W differentiable on U such that B = D 2 f(p) exists for 
p e U, let 


F(p) = Df(p) e L(K, W). 

Then for X, Y e V we have 

D 2 f(p)(X, Y) = [DF(p)(X)]Y. 

Example (1) Let / : U —► R be differentiable on U c V = R n and 
suppose D 2 f(p) exists. Then D 2 f(p) : V x V —> R is a bilinear form and we 
shall now compute its matrix. Let e„ e„ be the usual basis of V = R n 
and let g t (u) = Df{u){e^\ that is, 


g t (u) = DJ(u)(e t ) = (df/dx^u). 

Then F(u) = Df(u) is given by 

F(u)Z = Df(u)Z = £z ( . Dj\u)(e^ = Y. z iffi( u X (*) 
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where Z = £z f e f E ^ Next we note f° r X = Y* x j e j e V that 

Dffi(p)(X) =X X J D 0i(p)( e j) = X X J D J D ifW e i > e A (**) 

j j 

Thus for Y = Xy k e k , 

D 2 f(p)(X, Y) — [DF(p)(X)] Y 

= lim - [F(p + tX)Y-F(p)Y] 

t^O t 

= lim 7 X ydffkip + tX) - g k {p )] 

f-0 t k 

= X >■/< Dg k (p)(X) 

k 

= XX Dj DJ{p)ie k , e,.), 

* > 

using (*) for the third equality and (**) for the fifth equality. Thus with the 
notation 

d 2 f/dXi dxj(p) = £>( Djf(p)(ej, e t ) 

we see that the matrix for the bilinear form D 2 f(p) is the Hessian matrix 

(d 2 fld Xi dx/p)). 

For the general case of a differentiable function / : U -+ W we can find 
a formula for D 2 f(p) (assuming it exists) by putting together the Hessian 
matrices for the coordinate functions. Thus let e l9 ..., e m be the usual basis 
for W = R m . Then writing f{u) = ]T/i(tt)e;, where f x \ U-+R, we have for 
X, Ye V 

D 2 f{p){X, Y) = f D 2 Up)(X , Y)e, 

i= 1 

noting Z ) 2 is linear and D 2 f t (p) exist. 

Exercise (1) Let f \R-+R be given by f{x) — cos lx. Relate the 
“ usual” second derivative of/at p to the second derivative discussed above. 


Definition 1.9 A function / : U -*• W of class C 1 on £/ is of class C 2 on U 
if for all p e U, D 2 f{p) exists and the mapping 

U^L 2 (V, W) : p -+ D 2 f(p) 


is continuous on U . 
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It is easy to see that the set of all functions f : U-*W of class C 2 on U is a 
vector space and when W = R we frequently denote this vector space by C 2 
( U ). Thus / e C 2 (t/) if and only if all the partial derivatives of order less 
than or equal to 2 exist and are continuous on U. In this case it is known that 
we can interchange the order of differentiation and more generally we have the 
following result. (For the proof see Dieudonne [1960] and Lang [1968].) 

Proposition 1.10 Let U be open in V and let / : U -► W be of class C 2 
on U. Then for all p eU and X , Y eV 9 

D 2 f(p)(X, Y) = D 2 f(p)(Y 9 X). 

Thus D 2 f(p) is a symmetric bilinear form on V x V into W. 

Thus we see that if/ e C 2 (U ), then the Hessian matrix 

{d 2 f/dXf dxj(p)) 


is actually a symmetric matrix. 

We now define higher-order derivatives by induction. Thus for/ : U -► W 
and j?e(7we set 

D r f(p) = D[D r ~ 1 f](p) 
if the appropriate limit exists. The rth derivative 

D r f(p)eL(V 9 L(V 9 ... 9 L(V 9 W)...)) 

but, as in the case r — 2, we can identify this with the set of multilinear 
functionals from V r = V x • x V to W which we denote by L r (V, W). 

Definition 1.11 (a) Let U be open in V and let/ : U IF be continuous 
on U. Then / is of class C p on U or a C p function if for all u e U, D r f(u) exists 
and if 

D r f : U -> L r (K W) : u-+ D r f(u) 

is continuous for r — 1, ..., p. We use the notation C P (U) for the functions 
of class C p when W = R, 

(b) A function / : U -► W is of class C 00 on U or infinitely differentiable 
if / is of class C r for all r = 1, 2, .... When W = R we use the notation 
C°°(I/) for the class of infinitely differentiable functions. 

The following summarizes the results on C r -functions and for proofs see 
Dieudonne, [1960] and Lang [1968]. 
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Theorem 1.12 Let U be open in F — R n . 

(a) If/ : £/ -* FT is of class C r on U , then for each /? e t/ we have D r f(p) 
is a symmetric r-multilinear form; that is, 

D r f{p){X u ...,X r ) = DJ{p){X nW ,.... X„ (r) ), 

where X { e F and 7r is a permutation of the numbers 1, ..., r. 

(b) If/ : U -> IF and # : (7 -► IF are of class C r on (7, then so is af + bg, 
where a, b e R and 7) r (tf/ + bg) = aD r f + bD r g; that is, the C r -functions 
form a vector space. Also if W = F, then the pointwise product fg is of class 
C r ; that is, C r (U) is an algebra over R. 

(c) Let f \ U be of class C r . Then for all m, n with m + n = r we 
have 


D n+m f =D n D m f 

on U. That is, for pel /, 

D n+m f(p)(X u ..., Z n+m ) = [^"[^/KpX^i, . - , *■)](*■+!, .... *.+J. 

(d) Let / : U -► F be a function and let e ls ..., be the usual basis of 
F = F". Then / e C r (f/) if and only if all of the partial derivatives 

d q f(p)/8x it ■ ■ ■ dx iq = D, t ■ ■ ■ D iq f(p)(e h ,e iq ) 

for q = 1, ..., r exist for all p e U and are continuous on U . In this case we 
have for X u ..., X r e F, 

D r f{p){X u ..., X r ) = Yx Ul * ■ • x rir d r f(p)ld Xil • • * dx ir9 
where the sum runs over all possible r-tuples i u ... 9 i r of l,...r and 

Exercises (2) Let f : U —> W, where W = W x x * • • x W q and / is given 
by coordinate maps (/, ..., /). Show / is of class (7 on U if and only if 
each/ : (7 -► W t is of class C r on U. In this case show D r f = (/)/,..., 

(3) Let U be open in F and E be open in W. Let/ : U -» E and g : E-+Z, 
where Z is some Euclidean space, both he C'-functions. Then show the 
compositions g o / : *7 -► Z is of class C r on £/. Also show D r (T o /)(/) = 
r ° Z> r /(p) for T sL(W,Z) and p e U. In particular this holds for C°°- 
functions. 

(4) Let/ : U -> W be of class C r on U 9 let A u ..., e F, and let 

F: U~> W : x —► ..., 

Show F is differentiable on £/ and 


DF(p)(X) = DJ(p)(A u ..., A r _ u X) 
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for X e V and p e U. This is the usual way for computing higher derivatives 
in terms of lower derivatives. 

(5) (i) Show that the exponential mapping 

exp : End(V) ->GL(F): X -» 

is of class C 00 on a suitable ball about 0 in the Euclidean space End( V). 
Also show 

[D 2 exp(0)](X, X) = X 2 

and find a formula for the bilinear form [D 2 exp(0)](.X\, X 2 ). Show that 
[D r exp(0)](X,..., X) = X r . 

(ii) More generally, let f be a real power series with radius of 
convergence p. Then what can be said about the C°° nature of the function 
f(T) = Y<*nT n in End(F)? 

(6) Let n be a positive integer and let X e End( V) be fixed. Define 

/„ : GL(V) -*• GL(V): P -*P~ n , 
g„ : End(L) -*• End(F ): P -+P n , 
h : End(K) ^ End(E): P -*PXP. 

Show that : 

(i) [Dg n (P)](X) =l d P k XP"- k ~\ 

k = 0 

00 [Df n (P)](X)= -Y j P k -"XP- k - i , 

k~0 

(iii) [Dh(P)] Y = PXY+ YXP , 

(iv) [D 2 f t (P)](X, T) = P~ 1 XP~ l YP~ l + P~ 1 YP~ 1 XP~ 1 . 


4 . Taylor’s Formula 


We shall now discuss Taylor’s formula for vector-valued functions which, 
when restricted to real-valued functions, gives the usual polynomial approxi¬ 
mation of such functions. Also this is the formula which is used when the 
multiplication of a Lie group is approximated at the identity element by its 
Lie algebra. This is the approximation which allows the analysis of a Lie 
group by algebraic methods. The vector spaces in this section are still real 
Euclidean spaces. 
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Taylor’s formula Let U be open in V and let / : U -*• W be of class C r 
on U. Let p e U and X e V be such that the line segment p M tX for 0 < / < 1 
is in U. Let X (k) = (X,...,X) Ar-times and D k f(p)X (k) = D k f(p)(X, ...,X). 
Then 

f(p + X) = f(p + Df(p)X + X (2 > + • • • + X" + s r (X), 

where the error term satisfies 

lim e r (X)/||*H r = 0 . 
x-*o 

For the proof and other estimates for the error term we refer to Dieu- 
donne [1960] and Lang [1968]. However, this formulation is the most practical 
for our use. 

Definition 1.13 Let V and W be Euclidean spaces with U open in V. 
A function f : U -+ W is (real) analytic on t/if/is of class C 00 on U and if for 
each p e U, there exists an open ball B a U with center p so that for all 
q = p + X in B the series 

Z D r f( P )X (r) 

r = o rl 

converges absolutely in the Euclidean space topology and has value f(q). 
The function f : U-+Wi s analytic at/? in U if/is analytic on some neighbor¬ 
hood of p. We shall denote the set of functions / : U ^ W which are real 
analytic on U by C"(t/, W) or tf(U 9 W); see Dieudonne [1960] for more 
results on analytic functions. 


Examples (1) Let / : R -> R be defined by 


/(*> = 



for x > 0 , 
for x < 0 . 


Then it is easy to see that / e C°°(£) and by induction that D k f( 0) = 0. Thus 
since / is not identically 0 in any neighborhood of 0 , / is not analytic at 0 . 
Otherwise in a suitable open interval U about 0, 

/(*) = 1 ^/( 0 ) ^=0 


for all x e U. Thus the classes of C 00 and analytic functions are definitely 
different. 

(2) Let V , W 9 and Z be Euclidean spaces and let U be an open neighbor¬ 
hood of 0 in V and D an open neighborhood of 0 in W. Let 

f : U -> D and g:D->Z 
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be functions of class C 3 so that /(0) = 0. Assuming /, g and /° g satisfy the 
hypothesis of Taylor’s formula we have the following second-order approxi¬ 
mation for X e V at p = 0. 

(gof)(X)=g(f(X)) 


where 


= g( 0) + Dg(0)(f(X)) + (f(X), f(X)) 


+ e 2 


= g( 0) + Dg{0)\^Df(Q)X + -^ (Q) (X, X)j 
D 2 g{ 0) 


+ 


2! 




Z)/(0)^ + 


p 2 m 

2 ! 


(X, X), Df(0)X + 


p 2 m 

2 ! 


(X 


,*)] 


+ e 2 


= ^(0)+ Dg(0)Df(0)X + iDg(0)[D 2 f(0)(X, X)] 

+ \D 2 g(0)(Df(0)X, Dmx) + e 2 , 


lim e 2 (^)/||2C|| 2 =0. 
x-o 

We shall use this later in an example on the matrix group GL(V) in Section 

1 . 6 . 

It is easy to see that if a function f : U-> W is represented in terms of 
coordinates, then/^is analytic on U if and only if each coordinate function is a 
(real-valued) analytic function on U. Thus if for each x 6 U and a basis 
Y u Y m of W we have 


fix) = £ f } {x)Yj, 

j= i 

then/is analytic if and only if for each p e U there exists an open ball B c V 
with center p and m power series Pj,j= 1,..., m, in n variables so that 

fj(q) = P J (q l ~Pi,...,q n -p n ) 

for q e B, where a basis A,, .... X n of V gives a coordinate system in V. 

As an application of this formulation of analyticity we consider the Euc¬ 
lidean space End(K) relative to the sup norm and the exponential mapping 

exp : End(F) ->■ GL(V ) : X -> e x . 

From Section 1.3 we have for fixed a that exp is of class C°° on the open set 
U={Xe End(K) : ||*|| < a} and since 


[/)“ exp(0)]X w = X k 
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we have the rth order Taylor’s formula approximation is 

exp X = / + X + J X 2 + • • • + X r + e r (X), 
2 r\ 


where 


£,(20= £ X k /kl 

k = r+ 1 

We shall now show that the exponential series converges absolutely. First 
we note from example (1) of Section 1.1 by induction 

l|X fc || < ||Xl| fc 

and that the real series £(||X||*/&D ( = ^ llx|1 ) converges. Thus since 

we have that the exponential series is absolutely convergent. If we restrict 
ourselves to a closed ball of radius a 

B(0~a) = {Xe End(F): ||X|| < a\ 

then since the series I cf/nl is convergent, we see from the Weierstrass test for 
series that the exponential series is uniformly convergent on compact sub¬ 
sets of End(F). 

Now the exponential function is analytic on the open ball B( 0, a) which is 
contained in the compact ball of radius a. By choosing the usual basis of 
End(F) we determine coordinates x iu ..., x nn and represent any XeU 
by the matrix Next we observe that each coordinate function for the 
partial sum of the series ! is a polynomial in x ll9 ... 9 x m . Thus for 

each p g B( 0, a) we can find a suitable ball B in B( 0, a) so that each of these 
polynomials is of the form Pij(q n -Piu ...» q nn — P nn ) on & Therefore by 
the uniform convergence, each coordinate function for the series has a power 
series representation 

Pi 1» •••) Qnn Pnn) 

about p. 


Proposition 1.14 

mapping 


(a) Let V be a Euclidean space. Then the exponential 
exp : End(F) GL{V) \X^e x 


is analytic on End(F). 
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(b) If S, T e End(F) are such that ST = TS, then 

exp (T + S) = exp T exp S. 

(c) For a fixed S’e End(F), the map 

4 >•: R~* GL(V ) : t -> exp tS 

is an analytic homomorphism of the additive group of R into the multipli¬ 
cative group GL(V). In particular (exp S) n = exp(«S) for any integer n. 

Proof First note that the vector space End(F) x End(F) is a Euclidean 
space and in terms of coordinates the multiplication function /( of endo- 
morphisms is an analytic map 

H : End(F) x End(F) -*• End(F) : (S, T) -> S ° T 

where = g(S, T ). Now the curves 

.fit) = exp(E + tS) and g(t) ~ exp T exp tS 

are defined for all t e R . These are analytic at t = 0 because the multiplication 
of endomorphisms is analytic and exp is analytic on any ball B(0 , a) as 
discussed above. Now we use some results from elementary differential 
equations (which we shall consider in Chapter 2) and note that since ST =TS 
we have (exp S)T ~ T (exp S ). Consequently f and g are solutions to the 
differential equation 

dY(t)/dt = SY(t) with 7(0) = exp T. 

However, by the uniqueness of solutions we have fit) = g{t) in an interval 
about 0. Since/(r) and g(i) are defined for all t e R and analytic functions of 
t at 0, they are equal for all t e R. Thus for t= 1 we obtain (b). 

For s, t e R note that the endomorphisms sS , tS commute and therefore 
from (b) we see 

<£(s + t) — exp(.y + t)S — exp sS exp tS = cj)(s)<f)(t). 

We have shown that exp is analytic on any ball B{ 0, a). Thus exp is analytic 
at any p e End(V) by choosing the ball i?(0, a) above to contain p. 

Exercises (1) (i) Let U be open in V and D be open in PFand let 
/ : U-+ W and g : D^Z 

be analytic functions so that/(£/)<= D. Then show the composition go f 
is analytic on U. 
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(ii) Let U be open in V and suppose f, g e R ), the set of real¬ 

valued analytic functions on U. Show that R) is an algebra of functions 
relative to the pointwise product. 

(2) Show that the Taylor’s series representation for an analytic function 
is unique. That is, if/ : U -► W is analytic and if for p e U and X e V satisfying 
the analyticity conditions we have for all t with 0 < t < 1 that 


f(p + tX)=f j a k {p){tX) (k) = f b k (p)(tx )<*> 

*=0 *=0 

with absolute convergence, then show a k (p) = b k (p) as multilinear forms. 

(3) (i) In more detail, why is the map 

< t>:R-+GL(V):t-+exptS 


used in Proposition 1.14 analytic? 

(ii) Show that the functions / and g used in the proof of Proposition 
1.14 actually satisfy the indicated differential equation. 

(4) Let X e L n (V, W) be a multilinear map from V" to W and let P = 
(Pi, ..., P n ) and X = (X t , ...,X n )be in V". 

(i) Show lDX(P)](X ) = X(X t ,P 2 .P„) + • • • + A(P,, P 2 , ..., 

Pn-uXnl 

(ii) Find [D n X(P)]X (n \ 

(iii) Find the Taylor’s series expansion for X about 
P = ( 0 ,..., 0 ). 

(5) For a fixed basis of V - R n let A e End(F) be represented by the 
matrix a = (ci;j) so that a can be considered as an element of R n x • • • x R n 
by viewing the rows of a as elements of R”. Consequently the determinant 
function det : End(F) —► R can be considered in terms of the coordinates 
given by the above matrix representation. 

(i) Show det: End(F) -> R is differentiable (of what class?). 

(ii) In terms of matrices, let At, ..., a„ e R n denote the rows of a and 
Xj, ..., x„ denote the rows of any n x n matrix X. Show 


[D det(a)]X = £ det 


(6) There exist differentiable “bump functions” which vanish outside a 
compact set as follows. Show if P(0, a) and £(0, b) with a<b are two 
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concentric balls about 0 e R m , then there exists a C°°-function <j>: R m -+R 
such that 0 < <f>(x) < 1 for all x e R m and 


m = 


if q e B( 0, a), 

if qtB(0,b). 


{Hint: Use the function / of example (1), Section 1.4 and consider 
4>(x) - f(b 2 - \\x\\ 2 )J[f(b 2 - \\x\\ 2 ) + f(\\x\\ 2 - a 2 )]). 


5. Inverse Function Theorem 


We continue the notation of the preceding sections and investigate when a 
function /: U -*W has a local inverse and the various consequences. The 
vector spaces are still real Euclidean spaces. 

Definition 1.15 Let /: U -* W be of class C r on U and let p e U. Then / 
is locally invertible of class C r at p if there exists an open subset U 1 of U with 
p sU x such that there exists an open set D of W with f(p) e D and a function 
g : D -* U x of class C r on D such that g of and fog are the identity maps on 
U x and D, respectively. The function g is called a local inverse of / at/(/?). 

With this notation we have the following (see Dieudonne [1960], Lang 
[1968], and Spivak [1965]). 

Inverse Function Theorem Let U be open in F, let p e U, and let /: U W 
be of class C r on U. If Df(p) e Hom(F, W) is an invertible linear transform¬ 
ation, then / is locally invertible of class C r at p. Furthermore if g is a local 
inverse of/and q = f(p\ then Dg(q) = [Df{p)]~K 

Remarks (1) Since the vector spaces V and W are finite-dimensional, 
Df(p) being invertible yields dim V — dim W, 

(2) Let / : U -* D be continuous with a continuous inverse g : D -+ U 
and let U 1 be an open subset of U . Then f(U x ) is an open subset of D. Thus 
in particular if / : U -*W is of class C 1 and Df(p) is invertible, then for a 
suitable open subset U l of U with p e U u we have /({/) is open in W and 
f-Ui->f(Ui) is a homeomorphism. 

(3) A C 00 -function can have a continuous inverse without the inverse 
being differentiable. Thus the function / : R -► R : x -> x 3 is of class C 00 on R 
and has continuous inverse g : R^R: y-+y x/2 . However, g is not differenti¬ 
able at the point 0 (why?). 
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(4) The various results concerning the inverse function theorem also 
apply to C 00 and analytic functions. 

The inverse function theorem is used to obtain the following result. 

Theorem 1.16 (a) Let V u V 2 , W be real Euclidean vector spaces, 
let U be open in V i x V 2 and let / : U -> W be of class C r on U. If for (a, b) 
e U c V x x V 2 the linear transformation D 2 f(a , b) : V 2 -+ W is invertible, 
then dim V 2 = dim W and the map 

F: U-*V, x W:(x, y)-+(x, fix, y)) 

is locally invertible of class C r at (a, b). 

(b) (Implicit function theorem) Let U be open in V x x V 2 and let 
/ : U ^ W be a map of class C r such that for a given ( a , b) e U,f{a, b) = 0. 
If D 2 f{a , b) : V 2 -+ W is invertible, then there exists an open ball B in V 1 
with center at a e V u and there exists a uniquely determined map g : B^V 2 
of class C r on B such that 

0(a) = b and f(x, g(x)) = 0 

for all x e B. 

Proof Briefly for part (a) we note that since D 2 f(a , b) is invertible, 
dim V 2 = dim W. Thus writing F in coordinates, we have F(x, >>) = 
(F^Xy y), F 2 (x, y)) where F^x, y) = x and F 2 (x> y) — f(x, y). Consequently 
DF(a> b) has a matrix given by 

[ D^iayb) D 2 F 1 (a>b)\ _ T / 0 ] 

[D { F 2 (gy b) D 2 F 2 (a,b)\ [DJ&b) DJia^b)\ 

which is invertible. Now apply the inverse function theorem. 

For part (b) let 

h:U-+V,x W:(xyy)^{xyf(xyy)\ 

Then by part (a), h has a local inverse denoted by H. Thus since H is a map¬ 
ping into V x x V 2 , it can be represented by coordinate functions H = ( H U H 2 ) 
such that 

H(x, z) = (H,(x, z ), H 2 (x, z))= (x, H 2 (x, z)). 

Then the desired function is given by g(x) = H 2 (x, 0). To see that y — g(x) 
satisfies the equation, we have for 2 = 0 that 

(x, 0) = (h o H)(x , 0) 

= h(x, H 2 (x, 0)) 

= h(x, g(x)) = (x, f(x, g(x))). 
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To see that g(a) = b , we note 

(a, g(aj) = (a, H 2 (a, 0)) 

= (a, H 2 (a, f(a, b))) 

= (H° h)(a, b) = (a, b). 

The rest of the proof is not difficult and can be found in the work of Lang 
[1968] and Spivak [1965]. 

Example (1) The surface given by z = /(x, >>) = x 2 + y 2 — 1 defines 
a circle for those points ( x , y) with /(x, y) = 0. Choose ( a , 6) on the circle 
with b ^ 0. Then <9/(a, = 2b ^ 0 so that we can solve locally for y in 

terms of x. 

A variation of the preceding proof gives the following result which we shall 
use in Chapter 2. 

Proposition 1.17 Let U be an open set in R n which contains the point p 
and let / : U^>R q be of class C r on U where q < n. If f(p) = 0 and Df(p) 
has rank q 9 then there exist an open subset D of R n containing p and a locally 
invertible function g : D U of class C r such that for all (x : , ..., x„) e D , 

(/° g){x i, •••,*„) = (x„- q +u ■■■,x n ). 

Thus one can modify the coordinates of the point p by the open set D and the 
function g to obtain a simple expression for / locally. 

Proof We regard R n = R n ~ q x R q and U as an open set in R n ~ q x R q . 
Thus p = (a, b) e U a R n ~ q x R q is such that regarding/as a function of two 
variables we have /(a, b ) = /(p) = 0. Since Z)/(p) is of rank q we see D 2 f(a, b ) 
is of rank q (why?); that is, D 2 f(a , £): i? 9 is invertible. Thus we are in 
the situation of the preceding theorem as follows. If we let x = (x l5 ..., x n - q ), 
z = (x n _ ?+1 , ..., x„), and #(x 1? ..., x n ) = H(x , z) as in the above proof, we 
have for n : (x, z)-*z that 

(/ o £)(x, z) = (/oi/)(x,z) 

= i/)(x, z) 

= (7t o id[y)(x, z) = z. 

This also uses the notation of the preceding proof h(x , >>) = (x,/(x, .y)) so that 
f = n ° h, where H is the local inverse of h . 

Definition 1.18 Let be open subsets of the Euclidean spaces V t for 
i = l s 2. Then the map/: U x -> U 2 is a diffeomorphism of class C7 of onto 
Z7 2 if/is a homeomorphism of t/j onto U 2 so that /is of class C r on U 1 and 
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/ 1 is of class C on U 2 . We analogously define analytic diffeomorphisms and 
say / is a diffeomorphism iff is a diffeomorphism of class C°°. 

We continue our previous notation for the exponential mapping and have 
the following result. 

Proposition 1.19 There exists an open neighborhood U 0 of 0 in End(F) 
and an open neighborhood U 2 of I in GL(V) such that exp : U 0 -► U x is a 
dilfeomorphism of U 0 onto Uj. Furthermore exp and its inverse are actually 
analytic. 

Proof From the inverse function theorem and previous results, it suf¬ 
fices to show that D exp(O) is invertible. Thus using the series expansion, we 
have for A e End(F) that 

[D exp(0)](/l) = lim - [exp(0 + tA) — exp(O)] 
t^o t 

= lim - [7 + tA + s(t 2 ) — I] = A 
t ->o t 

using lim f _ 0 e(t 2 )/t = 0. Thus D exp(0) is the identity. 

Remarks (5) For any X e End(F) we have tX e U 0 for / el? in a 
suitable interval TV of 0 in R . Thus exp maps this line segment tX with t e N 
into an analytic curve segment <j>(t) = exp tX in GL(V) which passes through 
/; that is, exp(0) = / and the “tangent vector” to this curve at t = 0 is X. 

( 6 ) By considering the power series 

log(l +z) = Z“^- + *" + (- 1)” +1 — + ••• 

2 n 

and the proposition in Section 1.1 we have 

log(/+^) = ^-^ + --- + (-l) n+1 —+ ••• 

2 n 

converges when the absolute value of the characteristic roots of A is less than 
1. Now if we use the usual sup norm for T e End(F), we have 

II711 = sup \\TX\\/\\X\\ 

X±Q 

> maxdrj,.... |r„|}, 
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where r u ..., r„ are the characteristic roots of T and || || on the right side of 
the first equation denotes length in V. Thus choosing U 0 <=. {A e Hom( V, V ): 
Mil < 1} and Uj <= {B e Hom(F, V ) : ||5 — 7|| < 1}, we can show 


exp : U 0 ->Uj 

and 

log :Uj^U 0 

log exp A = A 

and 

exp log B = B. 


Thus log is the local inverse of exp and this inverse is analytic on U,. 

Exercises (1) Find two different 2x2 real matrices A and B such that 
exp A = exp B; that is, exp is not globally injective. 

(2) Use Taylor’s formula or the mean value theorem to prove a variation 
of the inverse function theorem. 


6. The Algebra gl{V) 

We shall now use many of the preceding results on the function exp to 
attach a nonassociative algebra gl(V) to the general linear group GL(V). 
The relationship between the group GL(V) and the algebra gl( V) is the basic 
model for studying a Lie group by its Lie algebra and we break down the 
analysis as follows. 

(1) The map exp is used to show that the global multiplication in GL(V) 
induces a local analytic multiplication in End(F). 

(2) We use the second derivative of the local multiplication in End(K) to 
obtain a bilinear multiplication r on End(F) which together yield the algebra 

(3) The properties of GL(V) are used to obtain a formula for t and the 
identities it satisfies. 

(4) Finally we show how automorphisms of GL(V) induce automorphisms 
of gl( V) and indicate some important formulas. 

For the first step let 

g : GL(V ) x GL(V) - GL(V ) : (x, y) g(x, y)~ xy 

be the multiplication in GL(V). Then since matrix multiplication is given in 
terms of polynomials of degree 2, the multiplication g is analytic. In particular 
by the continuity of g we have for any neighborhood U of I e GL(V) that 
there exists a neighborhood D of / so that 


D <= U and g(D , D) c U. 
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However, using the continuity of the map exp : End(F) -► GL{V) we can 
find for any neighborhood D of / e GL(V) a neighborhood E of 0 e End(K) 
so that 

exp(£) c= D. 

Now we choose the above neighborhoods according to Proposition 1.19 as 
follows: Let U cz Uj so that E c: U 0 and exp(is) c D c= U I . Thus 

/i(exp (E), exp(£)) <= Uj. 

This means, using exp U 0 => U It that we can define a function analytic on 
ExE 

F: E x E -* U 0 : (X, Y)^F(X, Y) 


so that for X, Ye E, 

/z(exp X, exp 7) = exp F(X, 7). 


From the remarks following Proposition 1.19 we see 
F(X, 7) = log Mexp X, exp 7). 

Thus the multiplication n in GL{V) induces a local analytic multiplication F 
in End(F); that is, we have neighborhoods t / 0 and E of 0 in End(F) so that 
E <=. U 0 and a “ multiplication ” F: E x E -*• U 0 . 

A variation of the above is as follows. Let X and 7 be any (fixed) elements 
of End(F). Then there exists a neighborhood N of 0 in R so that for all 
s, t e N we have sX, tY e E. Consequently we have the formula 

/;(exp sX, exp tY) = exp F(sX, 1 7) 

which gives an analytic function 

N x N-> U 0 : (s, t) F(sX, tY). 

Thus since F is analytic on E x E we have the Taylor’s series expansion 
about 6 = (0,0) eE x E as follows. Let Z = (sX, tY) be as above. Then 
9 + Z = (sX, tY) = Z so that 

£)^F(0} 

F(Z) = F(0) + DF(6)Z + Z (2) + ”‘- 


In particular for s = t we obtain 


F(sX, sY) = F(6) + sDF(9)(X, Y) + s 2 


D 2 F(6) 


((X, 7), (X, 7)) + • • •. 


2! 
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For the second step we now compute the first few terms of this Taylor’s 
series. From exp 0 = / we obtain 

exp F( 0 , 0 ) = /i(exp 0 , exp 0 ) 

= I) = / = exp 0 

and since exp is injective on U 0 (that is, applying log) we have 

F(6) = F( 0, 0) = 0. 

Similarly since exp 0 is the identity element in GL(V ) we have 
exp sX = /*(exp sX, exp 0) = exp F(sX , 0) 

so that 

sX = F(sX> 0) 

00 D k F(6\ 

= sDF(9)(X, 0) + £ --i- ; (X, 0)<» 

4 = 2 A'.' 

Thus by comparing degrees (or differentiating term-by-term at j = 0) we 
obtain 

DF(0)(X, 0) = X and D k F(0)(X, 0) w = 0 for k > 2. 

Also we obtain 

DF(9)(0, Y) = Y and D k F(9)( 0, = 0 for k> 2 

so that we have the following two formulas. 

DF(9)Z = DF(0)(sX, tY) 

= DF(9)[(sX,0) + (0,fY)] 

= sDF(9)(X, 0) + tDF(9){ 0, 7) = sX + t Y 

and 

D 2 F(9)Z (2) = D 2 F(9)[(sX, 0) + (0, tY), (sX, 0) + (0, tY)] 

= D 1 F(9){sX , 0) (2 > + D 2 F(9)(0, tY) (2 > + 2D 2 F(0)[(sX, 0), (0, t7)] 
= 2 1?? D 2 T(0)[(7, 0), (0, 7)], 

where the last two equalities use the symmetric bilinearity of D 2 F(6) on 
End(F) x End(F) and D 2 F(9)(X, 0) (2) = D 2 F(9){ 0, 7 )< 2 > = 0. 

From D k F(9)(X, 0) (k> = B k F(0)(O, Y) a> = 0 we see that in the Taylor’s 
series expansion for F(sX, tY) the coefficients of s k and t k are 0. Consequently 
the error term 


£ 2 ( 57 , r7) = st(j>(s, t). 



6. THE ALGEBRA gl(V) 


35 


where <j)(s , /) ->0 as .v -> 0 and t -> 0. Thus Taylor’s formula for the local 
multiplication becomes 

F(sX, tY) = sX+ tY+st t(X, Y) + st<j)(s, t ), 

where 

r(X, Y) = D 2 F(0)[(X, 0), (0, Y)] = iD 2 F(0)(X, Y) (2) . 

Next, since the second derivative is a quadratic function defined on the 
whole vector space, we see t(X, Y) is defined for all X, Y e End(F). Also 

x : End(F) x End(F) -^End(F) 

is bilinear as follows. For X, Y, U e End(F) 

x(X + Y,U)= D 2 F(6)[(X + Y, 0), (0, 17)] 

= D 2 F(0)[(X, 0), (0, U)] + D 2 F(9)[(Y, 0), (0, £/)] 

= z{X, U) + x(Y, U). 

Similarly x(t/, X+Y) = x(U, X) + t(U, Y) and for a e R, x(aX, Y) = 
x(X, aY) = ax(X, Y). Thus the vector space End(F) together with the bilinear 
map x : End(F) x End(F) -► End(F) becomes a nonassociative algebra as 
follows. 

Definition 1.20 (a) A nonassociative algebra A over a field K is a vector 
space A together with a bilinear multiplication z e L 2 (A, A); note that “non¬ 
associative ” means not necessarily associative. 

(b) The nonassociative algebra gl(V) with multiplication z as above is 
called the Lie algebra of GL(V). 

From the above formulas we see that the algebra gl(V) determines the 
multiplication in GL(V ) locally up to order two. Thus for s, t in a suitable 
neighborhood of 0 in R we have for all X , Y e gl(V) 

/x(exp sX, exp tY) = exp^^ + t Y + stz{X , 7) + e 2 (sX, ^7)), (*) 

where X + 7 is the addition in gl(V) and z(X, 7) the multiplication. We shall 
show later (Campbell-Hausdorff theorem) that the error term e 2 (JT, 7) is 
actually determined by the subalgebra of gl(V) generated by X and 7. 

For the third step we use various properties of GL(V ) to determine 
identities for gl(V). 

Proposition 1.21 If X 9 7, Z egl(V), then 

(a) z(X, 7) = — t(7, X); 

(b) t(X, x( Y, Z)) + x( Y, x(Z, X)) + x(Z, r(X, Y)) = 0; 

(c) r(X,Y)=i(XY-YX). 
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Proof (a) Since (exp X) 1 = exp( — X) the inverse of the left-hand side 
of (*) above becomes 

exp( - tY) exp( - sX) = exp( - 1Y - sX + stx{ Y, X)+ ■■■) 
and the inverse of the right-hand side of (*) becomes 
exp(-iZ — tY — stz(X, Y) 4 -). 

The required identity now follows from comparing the terms of degree 2. 

(c) The two sides of (*) can be expanded by power series methods using 
the multiplication in End( V) as follows 

exp(s'Z) exp(t Y) = (/ + sX + ±s 2 X 2 + • • •)(/ + t Y + \t 2 Y 2 + • • •) 

= l + sX+tY+ }s 2 X 2 + stXY+}t 2 Y 2 + ■■■, 
exp(s X +tY+ stx(X, Y) + •••) = /+ ( S X + tY + stx(X, Y) + • • •) 

+ HsX+tY+---) 2 + ■■■ 

= I + sX + tY + stx(X, Y) 

+ i(s 2 X 2 + stXY + st YX + t 2 Y 2 ) + • • •, 

where the omitted terms are of degree greater than or equal to 3 in 5 and t. 
Equating the coefficients of st in the two expressions we have 

XY = x{X, Y) + \XY + iY X 

and the desired identity now follows, 

(b) This can be computed directly by substituting (c) into the left side of 
(b) and using the associativity of multiplication in End(F). 

Remarks ( 1 ) Part (c) above could have been proved directly as indicated 
and the property (a) obtained from this. We shall also indicate how to obtain 
(b) from facts on automorphisms and Taylor’s series. 

(2) An abstract Lie algebra over a field K is defined to be a nonassociative 
algebra whose multiplication satisfies the anticommutativity and Jacobi identity 
given, respectively, by (a) and (b) above. 

(3) When just the Lie algebra gl(V) is considered the multiplication is 
usually denoted by 

[X, Y] = XY-YX 

which is called the commutator of X and Y. 

For the fourth part on automorphisms we manipulate more Taylor’s 
series. 

Proposition 1.22 Let/be an analytic diffeomorphism of GL(V ) which is 
also an automorphism of the group GL(V ). Then Df(T) is an automorphism 
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of the Lie algebra gI(V); that is, Df(I) e GL(End(K)) and 
mw, Y ) = *(DfW, D/m) 

for all X, Y e End(F). 

Proof Since/is a differentiable automorphism with differentiable inverse 
/ -1 we have/ -1 °/ = idy, the identity map on End(K). Therefore by the chain 
rule 

idy = D(idy)(I ) 

= D(f~ l o/)(/) = Df-\f(D) o Df(l) 

so that Df(l) is invertible; that is, Df(I) e GL(End(K)). 

Next notice that from the continuity of/ there exists a neighborhood D 0 
of 0 e End(F) so that D 0 c: U 0 and X e D 0 implies/(exp X) e U t . Define 

k : D 0 -*■ U 0 : X -> log/(exp X) 

so that for X e D 0 , 

/(exp X) = exp k(X). 

From this last equation it easily follows that A:(0) = 0 and that 

(Df)(D = (D/)(exp 0) o (D exp)(0) 

= D(fo exp)(0) 

= £>(exp o /<)(()) 

= ( D exp)(/c(0)) o {Dk){ 0) = Dk{ 0) 
since (D exp)(0) is the identity. Thus 

k(0) = 0 and Df(I) = Dk{ 0). 

As earlier in this section define F(X, Y) e End(F) for X, Y in End(K) suf¬ 
ficiently close to 0 so that 

/:(exp X, exp Y) = exp F(X, 7). 

Then for X, Y close enough to 0 we have 

exp k(F(X, Y)) = /(exp F(X, Y)) 

=/(ju(exp X, exp 7)) 

= /x(/(exp 7),/(exp 7» 

= /;(exp k(X), exp k(X)) 

= exp F(k(X), k( Y)) 


so that 


k(F(X, 7)) = F(k{X), k(Y)). 
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Let 9 = (0,0) and Z = (X, Y). Then the Taylor’s series for a composite function 
calculated earlier yeilds 

(k c F)(Z ) = Dk( 0) o DF(9)Z + \Dk{ 0) o D 2 F(9)Z '< 2 > 

+ iD 2 k(0)(DF(9)Z) (2) + e 2 (Z) 

= Dk(0)(X+ Y ) + Dk(0)(z(X , Y)) 

+ iD 2 k(0)(X+ y) (2) + e 2 (Z) 

since DF(9)(Z) = X + Y and D 2 F(9)Z <2) = 2z(X, Y). Similarly F(k(X), k(Y)) 
has a Taylor’s series 

F(k(X), k(Y)) = DF(9)(k(X), k(Yj) + iD 2 F(6)(k(X), k(Y ))< 2 > + ■■■ 

= k(X) + k(Y) + r(k(X), k(Y)) + ■■■ 

= Dk(0)(X) + Dk(0)(Y) + \D 2 k{<S)X w 

+ iD 2 k(0)Y (2) + t(Z)^(0)(Z), Dk(0)(Y)) + s 2 '(Z), 

where e 2 (Z) and e 2 (Z ) in the two expansions involve only terms of degree 
greater than or equal to 3. 

Now if we substitute sX for X and tY for Y in the two series, it is clear that 
the two coefficients of st must be equal. Thus 

Dk(0)z(X, Y) + D 2 k(0)(X, Y) = z(Dk(0)X, Dk(0)Y). 

However, D 2 k(0)(X, 7) is symmetric in X and Y and t is an antisymmetric 
bilinear map so clearly D 2 k(0)(X, Y) = 0 and 

Dk(0)z(X, Y ) = z(Dk(0)X, Dk(0)Y). 

Since Dk(0 ) = Df(I) we have completed our proof. 

Examples (1) Let X' denote the transpose of X e End(F) relative to the 
usual inner product in V=R"; that is, B(X’P, Q ) = B(P, XQ). Then the map 

/: GL(V) -* GL(V) : X (A") -1 

is an automorphism of class C°° with differentiable inverse. Thus Df(I) is an 
automorphism of gl{V ) and is given by 

Df(l) Y = lim - [/(/ + sY) -/(/)] 

.s-^O S 

— lim - [(/' + jT') -1 -/] 

s-»0 S 

= lim - [I - sY‘ + s 2 (Y') 2 +-/] = - Y‘. 

s->0 S 
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(2) Next we consider inner automorphisms 

/: GL(V) -» GL(V) : Y-^ZYZ -1 

for fixed Z e GL(V). Thus/is of class C 00 with differentiable inverse and Df(l) 
is an automorphism of gl(V) given by 

Df(I) Y = lira - [Z(I + sY)Z~ 1 - Z(/)Z _1 ] = ZYZ" 1 . 

$-►0 *5 

Moreover if Z = exp U for suitable V e End(Y), then by multiplying power 
series we obtain 

Df(l) Y = (exp £/) Y(exp - V ) 

= Y+UY-YU + ±U 2 Y- UYU+iYU 2 + ••• 

= Y+[U,Y] + i[U 9 [U,Y]]+---. 

We shall use the notation 

ad U:gl(V)^gl(V): Y->[U, Y] 

and observing that 

e ad 17 7 = [/ + ad {/ +i(ad U) 2 + • •] 7 = 7+[*7, 7] + i[*7, [£/, 7]] + — 
we indicate below how to show 

Df(I) = e ad t/ . 


Exercises (1) Prove the Jacobi identity [Proposition 1.21(b)] for gl(V) 
using inner automorphisms of GL(V ) possibly as follows: 

(i) For 2t(X, 7) = [ X , 7] the above formula for Z = exp tU becomes 
Df(I) 7=7+ 2 tx{U 9 7) + s(t 2 ), where e(t 2 )/t -+ 0 as t -► 0. Apply this replac¬ 
ing 7 by t( Jf, 7). 

(ii) Similarly compute r(Df(I)X, Df(I)Y) and use part (i) with the fact 
that Df(I) is an automorphism of the multiplication r. 

(2) Let Z = e u and let /= f(U) : 7 -+ZYZ~ X as in example (2) above. 
To show Df(l) = e ad v one can proceed as follows. Let <x(t) = e * ad v Y for t e R 
and for 7 e GL(V ). Then note that a(0) = 7 and oc'(t) = ad U(oc(t)). Similarly 
letting P(t) = D[f{tU)){I)Y , note that p(0) = Y and pit) = e tu Ye~ tU . Conse¬ 
quently P'(t) = ad U(p(t)) and using the uniqueness results for differential 
equations we obtain Df(l ) = <? ad 





CHAPTER 2 


MANIFOLDS 


A manifold is a topological space where some neighborhood of a point looks 
like an open set in a Euclidean space. Thus we are able to translate the cal¬ 
culus of the preceding chapter to this type of a space and we develop the 
formalism in this chapter. In the first few sections we consider differentiable 
structures, the definition of a manifold, and real-valued differentiable func¬ 
tions defined on a manifold. Next we consider submanifolds and how they 
arise from the inverse function theorem; we give many examples of sub¬ 
manifolds which are subgroups of GL(n , R). The derivative is generalized to 
a tangent at a point p in a manifold M and then the vector space spanned by 
these tangents generalizes the tangent plane of a surface. As the point p varies 
over M we obtain a variable tangent vector which is formalized via vector 
fields. We give many examples concerning GL(n, R ) which will be abstracted 
in later chapters; in particular we consider the invariant vector fields on 
GL(n , R) and their integral curves. 

Most of this chapter is used in the rest of the book and the reader who 
knows this material need only look at the examples. However, if one is un¬ 
familiar with manifolds it might be best to read through Section 2.3, then read 
Chapters 3 and 4 for applications before finishing this chapter. The reader 
should note that we are assuming a neighborhood of a point is an open set in 
the space. 
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1. Differentiable Structures 


We now extend the basic concepts of Euclidean space to a topological space 
which locally looks like Euclidean space via suitable choices of “ coordinates.” 

Definition 2.1 (a) Let V = R m and let X x ,..., X m be a basis of V so that 
we can represent any point p = £ p k X k e V uniquely. Relative tothis basis, we 
define coordinate functions u t for / = 1, ..., m on R m by 

u i :R m -+R-.Yj> k X k ^p i . 

We shall frequently use the usual orthonormal basis e 1 ,..., e m to obtain the 
usual coordinates u t given by u % {a x , ..., a m ) = a t . 

(b) Let M be a topological space and let p e Af. An m-dimensional chart 
at p e M is a pair ( U , x), where U is an open neighborhood of p and x is a 
homeomorphism of U onto an open set in R m . The coordinates of the chart 
(U, x) are the functions x t for i = 1,..., m given by 

x i = u i ox: U^>R:q~* x t (q), 

where x { (q) = u £ (jc(g)) and the u x are coordinates in R m . We frequently write 
x = (x t , ..., x m ). The set U is called a coordinate neighborhood and (U, x) is 
called a coordinate system at pe M. 

Definition 2.2 An m-dimensional topological manifold M is a Hausdorff 
space with a countable basis such that for every p e M there exists an m- 
dimensional chart at p. In this case we say that the dimension of M is m. 

Thus in particular we can find a covering of M by open sets and each open 
set U in the covering is homeomorphic to the open m-ball B m — {a e R m : 

Ml < !}• 

Examples (1) Any open subset N of R m is a manifold of dimension m, 
since N itself is a coordinate neighborhood of each of its points and the 
identity map x is such that (N, x) is an m-dimensional chart. Thus for V = R n 
we have GL(V) a R" 2 is a manifold of dimension n 2 . Note that for a fixed 
basis in V, any linear transformation A e GL(V) has a unique matrix repre¬ 
sentation (fly) and coordinate functions x tJ can be defined by x u (A) = a tj . 

More generally, if N is an open subset of a manifold M, then N becomes 
a manifold by restricting the topology and charts of M to A, and N is called 
an open submanifold of M. 

(2) The unit circle M = S 1 = {a e R 2 : ||fl|| = 1} with the topology in¬ 
duced from R 2 is a one-dimensional manifold, and the collection of open sets 
which covers S l can be taken to have two elements. More generally, we shall 
show later that the n-sphere S n ={ae R” +1 : ||#|| = 1} is an ^-dimensional 
manifold, and the collection of charts can be taken to have two elements. 
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(3) The closed interval M = [0,1] is not a one-dimensional manifold 
since the point 0 is not contained in an open set U a M which is homeo- 
morphic to an open set in R. Is the loop M as indicated in Fig. 2.1 a manifold? 
Thus is the point of intersection contained in an open set U c= M which is 
homeomorphic to an open set in Rl 



The coordinate functions given for the manifold GL(V) are differentiable 
of class C 00 (actually analytic), and we now define such notions in general. 

Definition 2.3 A set s# of (ra-dimensional) charts of an m-dimensional 
manifold Mis called a C°°~atlas if stf satisfies the following conditions. 

(a) For every p e M, there exists a chart ( U , x ) e stf with p e U\ that is, 
M = \J{U: (U,x)esf}. 

(b) If (U(x), x) and (U(y),y) are in jtf, where U(z) is the coordinate 
neighborhood corresponding to the homeomorphism z, then U(x) n U(y) is 
empty or the maps x o y~ x and y ° x~~ l are of class C°°. 

Note that x oy~ l (respectively y ° x~ l ) has domain y{U(x) n U(y)) 
[respectively x{U{x) n C/(y))] and transforms these subsets of R m homeomor- 
phically onto each other (Fig. 2.2). Thus since one of these maps is the inverse 
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of the other, their derivatives are invertible linear transformations on R m , 
using the chain rule. These maps are called a change of coordinates and one 
says that the corresponding coordinate ( U(x ), x) and {U(y),y) systems of p 
are compatible when they satisfy condition (b). This will eventually lead to 
the fact that if a function f:M-+R is differentiable in one coordinate system, 
then /is differentiable in any compatible coordinate system. 

Definition 2.4 Let srf be a C°°-atIas on an m-dimensional manifold M. 
Then a chart (U, x ) is admissible to $0 or compatible with 3 if (U, x) is com¬ 
patible with every chart in 3 ; that is, for any ( U(y ), y) e 3, we have ( U , x) 
and (£/(y), y) satisfy condition (b) in Definition 2.3. 

Now given any atlas 3, one can adjoin all charts which are admissible to 
3 and obtain a collection 3 which is again an atlas on M. Thus 3 is maximal 
relative to properties (a) and (b) of Definition 2.3, and any atlas is contained 
in a unique maximal atlas. 

Definition 2.5 (a) An m-dimensional topological manifold M has a 
C^-differentiable structure or just a C°°-structure if one gives M a maximal 
C°°-atlas. Thus to give a C 00 -differentiable structure, one need only exhibit a 
C°°-atlas on M, then consider the maximal atlas containing it. 

(b) A differentiable manifold of class C 00 or just a C°°-manifold is an m- 
dimensional topological manifold M to which there is assigned a C°°-dif- 
ferentiable structure. 

Remarks (1) One obtains differentiable manifolds of class C k , k > 0, 
or real analytic manifolds by just demanding that the change of coordinates 
y o x~ 1 and x ° y~ l given in Definition 2.3(b) is of class C k or analytic. 

(2) To define an m-dimensional complex manifold just replace R m in the 
definition of differentiable manifold of class C°° by the m-dimensional complex 
space C m . Condition (b) in Definition 2.3 must be modified by demanding 
that the functions y o x' 1 and x o y~ x be holomorphic in the respective sets 
in C m . 

Examples (4) Let M = R and define a coordinate system (t/(x), x) by 
U(x) = R and x : M -► R : t -> t. Then = {(t/(x), x)} is a C^-atlas which 
defines a differentiable structure and R is a differentiable manifold of class 
C 00 relative to this structure. Now let = R and define a coordinate system 
({/(v), y) by U(y) = R and y: M t -> R : t ->• t 3 . Then = {{U(y), y)} is a 
C'-atlas since U(y) covers M, and the map y ° y~ x , the identity, is of class C°°. 
Thus Definition 2.3 is satisfied. The atlas ,5/ t makes M , into a C°°-manifold. 
The manifolds are distinct in the sense that the charts (U(x), x) and ( U(y), y) 
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on R are not compatible since x ° y~ x : R -+ R : t -+ t x/ 3 is not differentiable 
at t = 0. 

(5) Let S n = {a e R n+i : ||a|| = 1} be the n-sphere with the topology in¬ 
duced from R n+1 and \\a\\ 2 = X?=i a i‘ f° r a = ( a i >• • ■» a n+i) e We 

define a differentiable structure on S n as follows. Let p = (0,..., 0, 1) be the 
“north pole ” and q = (0,..., 0, — 1) be the “south pole” of S n . Then the 
open sets U(p ) = S n — {p} and U(q) — S n — {q} cover S", and we define co¬ 
ordinate functions x and y so that {(U(p), x), (U(q), j)} is an atlas on S n . The 
functions x and y are defined by stereographic projections as follows. For 
a e U(p) let X be the line determined by the points p and a and let n be the 
plane in R n + l given by u n+1 = 0. Then the value x(a) is the point in R n+1 
where X and n intersect. Thus we have a map x: U(p) -> R n (see Fig. 2.3). 



More specifically if a = (a x ,..., a„ +1 ), then x(a) = (x x ,..., x„), where x f = 
aj 1 — a n+1 for Similarly y is given by stereographic projection 

y : U(q) R n : a -> (y t , ..., y n ), where y t = aj 1 + a n+1 for i = 1,..., «. From 
the formulas, the functions x and >> are homeomorphisms onto R n , and the 
formulas show that x °y~ x and y o x _1 are of class C 00 . Thus we obtain an 
atlas which makes S n into a C°°-manifold. 

Note that S n is a special case of manifolds defined by the implicit function 
theorem as follows. Let /: R n+1 -> R be a C°°-function and suppose that on 
the set M ={p e R n+1 : f(p) = 0} we have Df(p) ^0 or more generally 
Ai+i f(p) ^ 0. Then one can apply the implicit function theorem to obtain 
a neighborhood of p e M which projects in a bijective manner onto the plane 
u n+1 = 0 and yields an atlas which makes M into a C 00 n-dimensional mani¬ 
fold. Thus for S n , take f(x t ,..., x„ +1 ) = x t 2 + • • * + x* +l — 1 and note 
Df(p) # 0 for p e S n = f~\0) (just compute Df(p) for p s S n ). 

(6) We now consider the product manifold determined by two C 00 - 
manifolds M and N. Thus let ( U(x ), x) and (F(y), y) be in the maximal atlases 
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for M and N with U(x) [respectively V(y)] a neighborhood of p e M (respec¬ 
tively q e N). Then define an atlas on the topological product space M x N 
by letting U(x) x V(y) be the coordinate neighborhood of (p,q)e M x N and 
define the homeomorphism 

x x y: U(x) x V(y) -> R m x R n : («, v) -► (x(u), j(f)). 

Thus the set of all these charts (£/(*) x V(y), x x y) defines a C^-atlas on 
M x N and the corresponding maximal atlas defines a C°°-differentiable 
structure onMxiV. The product manifold of M and N is the Hausdorff space 
M x N with the C°°-structure as given above. Similarly one can define the 
product of any finite number of differentiable manifolds. 

Next let S 1 be the unit circle with the usual C^-differentiable structure and 
let T” = S l x • • • x S 1 (//-times) be the product manifold. Then T n is called 
an //-dimensional torus. Thus, in particular, since T 2 = (J {{*} x S 1 : x e S*}; 
that is, T 2 is a union of unit circles whose centers are on a unit circle, we 
obtain Fig. 2.4. 



As shown in Fig. 2.5, T 2 can also be represented as a closed square whose 
points on the top edge are identified with those directly below on the bottom 
edge. The points on the right and left edges with the same heights are identi¬ 
fied; in particular, the four vertices are identified as the same point. This 
identification comes from appropriately cutting and bending the above 
diagram for T 2 . Note that since S 1 = {exp 2nix : 0 < x < 1}, we can identify 
T 2 = {(exp 2nix, exp 2niy ): 0 < x < 1 and 0 < y < 1} with [0, 1) x [0, 1) as 
above. 


♦ 


—- 

Fig. 2.5. 
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2. Differentiable Functions 


A mapping /: M -* N of two C°°-manifolds will be seen to be differentiable 
of class C 00 if its ‘‘coordinate expressions” are differentiable. Thus we shall 
reduce the differentiability of / to investigating the differentiability of func¬ 
tions g : R m -> R. 

Definition 2.6 Let M and N be C°°-manifolds of dimension m and n , 
respectively, and let 

be a map defined on a neighborhood of a point p e M. We say that / is 
differentiable at p of class C°° if there exists a coordinate system (£/, x ) at p 
in M and a coordinate system (F, y) at/(p) in N such that 

yofo X -': X (U)^y(V) 

is differentiable at x(p) of class C 00 (see Fig. 2.6). Note that x(U) ci R m and 
y(V) C /r 




Since differentiability is given in terms of specific charts, we must show 
that it is actually independent of the choice of charts. Thus let (E7, x ) [re¬ 
spectively (F, y)] be any other elements of the atlas for M (respectively N) 
which are neighborhoods of p [respectively/^?)]. Then we must show the map 

y ofo x" 1 ; x(U ) -> y(V) 

is differentiable at x(p). However, since differentiability is a local property, 
it suffices to show this on neighborhoods. Thus for U n U and V n V (which 
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are nonempty), we have on the neighborhoods x(U n l/) and y(V n F) that 
yofox- 1 =yoy- 1 o(yofo x - 1 )oxox- 1 . 

Thus since y °y -1 , y °/° x _1 and x ° x -1 are of class C°°, so is their com- 
position. If/: M -► N is C 00 -differentiable at every point pe M, then / is a 
differentiable map of class C 00 from M into N. 

Now in terms of coordinates, it suffices to show that the functions 

fi = Ui o (y ofo x -1 ) :R m ->R 

for / = 1,..., n are differentiable on an open subset D of x(p). Thus if (U, x ) 
and ( V, y) are the corresponding coordinate systems, then we obtain the co¬ 
ordinate expression 

yi=fi(x x m ) for i = 1,m 

which must be differentiable at x(p) — (p x , ..., p m ). This yields the following; 
for example, see Bishop and Goldberg [1968, p. 37]. 

Proposition 2,7 Let /: M -> TV be a continuous mapping of two C 00 - 
manifolds. Then / is of class C 00 on M if and only if for every real-valued 
C°°-function y : V -> defined on an open submanifold K of N, the function 
y of is of class C 00 on the open submanifold f~\V) of M. 

We shall write C°°(M) or F(M) for the set of real-valued C 00 -functions on 
M and C G0 (p) or F(p) for the set of those real-valued functions which are C 00 - 
differentiable at pe M. Note that since differentiability of / at p e M also 
means / is defined on a neighborhood U of p , the elements of C°°(p) are 
actually pairs (/, U). Consequently one can define an equivalence relation for 
elements of C°°(p) such that (f x , L^) ~ (/ 2 ? t/ 2 ) if and only if there exists an 
open set G with peG and f Y {q) = / 2 (^) for all ^ e G. The set of equivalence 
classes are called germs of C°°-differentiable functions at p . Note that the 
coordinate functions x t on U are in C°°(p). We shall usually not use this 
terminology but just the underlying ideas. 

Next note that F — C^iM) is an associative algebra over R with operations 
given by 

(af ){p) = af(p) for a e R, 

(/+ g)(p) =f(p) + g(p) 

(fg)(p) =f(p)ff(p) for f,ge C 0O (Af) 

and F satisfies the following [Helgason, 1962, p. 5]. 

(1) If/i, ... ,/ r e Fand if g : R r -► R is of class C°° on K'*, then ,..., 

fr)eF. 
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(2) If /: M -» R is a function on M such that for each p e M there is a 
^ e F and there is a neighborhood £/ of p such that f(q) = #(#) for all q e [/, 
then/ef 

(3) For each p in the ra-dimensional manifold M, there exist m functions 

F and an open neighborhood U of p such that the mapping 

U —> R m : q ■ ■ ■ ,f m {q)) 

is a homeomorphism of U onto an open subset of R m . The functions fi, 
f m and the set U can be chosen so that for any/ef, there is g : R m -> R of 
class C x and 


/= ff(A ,■■■ ,/ m ) 

on U. 

These properties determine a differentiable structure on M as follows (see 
Helgason [1962, p. 6] for a proof). 

Proposition 2.8 Let M be a topological Hausdorff space and let m be an 
integer greater than 0. Let F be a set of real-valued functions on M satisfying 
properties (l)-(3). Then there exists a unique collection of charts srf ={(U a , 
x a ): a e A} which form a maximal atlas of M such that the set of real-valued 
C°°-functions on the manifold M with atlas st equal the set F. 

Definition 2.9 The C°°-manifolds M and N are diffeomorphic if there 
exists a homeomorphism f\M^N such that / and /" 1 are of class C 00 ; / is 

called a diffeomorphism. 

Thus a diffeomorphism yields an equivalence relation such that the two 
manifolds are not only topologically equivalent, but also they have equivalent 
differentiable structures. 

Examples (1) Let R be a manifold with the usual structure x : R -* R : 
t t and (— 1, 1) be an open submanifold of R. Then 

/:(— L \)-*R:t-*tf(\ - t 2 ) 

is a diffeomorphism. 

(2) Let R be the above manifold with the usual structure, and let M x be 
the manifold with space R and coordinate function y : M x R : t-+ £ 3 . Then 
the map/: M x -> R : s —> 5 3 is a C^-homeomorphism and the inverse homeo¬ 
morphism f~ l : R -► M x : u -> u 1/3 is actually differentiable of class C 00 
relative to the above differentiable structures: For t e R we have the coordin¬ 
ate expression (y o/” 1 ° x~ 1 )(t) = (y °/ _1 )(0 = y(t 1/3 ) = t. However, note 
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that identity map g : M x R : t t is not C 00 since (x ° g ° y x )(t) = t 113 
which is not C 00 , that is, the identity map is not a diffeomorphism. 

Exercise Let A/ be a C 00 -manifold. Show that the charts ( U, x) and (K, y) 
at p g M are compatible if and only if a; and y are C°°-related by y = f{x) and 
x = g(y) for suitable C^-functions / and g. 


3. Submanifolds 


We shall now use the preceding results to study certain substructures of a 
manifold and return to these topics again after studying the differential of a 
function. 

Definition 2.10 Let M and N be C°°-manifolds of dimensions m and n 
respectively, and let/: A/-> N be a C°°-mapping. 

(a) We call / an immersion of M into N if for every p e Af, there is a 
neighborhood U of p in Af and a chart ( V 9 y) of f(p) in N such that if we 
write y = (y u ..., y n ) in terms of coordinate functions, then x t = y. of \U for 
i* = 1, ..., m are coordinate functions on U in Af. That is if x = (x u ..., x m ), 
then (£/, x) is a chart at p in M. We say that M is immersed in N if an im¬ 
mersion f:M-+N exists. 

(b) We call/an embedding if/is injective and/is an immersion. Also M 
is said to be embedded in N. Thus an immersion is a local embedding. 

(c) We call the subset /(Af) of A a submanifold of N if/is an embedding 
and /(Af) is given a C°°-differential structure for which the mapping of mani¬ 
folds /: Af ->/(Af) is a diffeomorphism. In particular if Af is a subset of the 
C^-manifold N and Af has its own C°°-differentiable structure, then Af is a 
submanifold of N if the inclusion map /: Af -► N : x -* x is an embedding. 
Thus a coordinate system on N induces a coordinate system on Af. 

The subset /(Af) is called an immersed submanifold if the above mapping 
/is just an immersion. The topology of a submanifold Af <= N need not be the 
induced topology of the containing manifold. However, since the inclusion 
map is C°° and consequently continuous, the open sets in the induced topology 
are open sets in the submanifold topology. Also note that the dimension of a 
submanifold is less than or equal to the dimension of its containing manifold 
and in the case of equality we just obtain open submanifolds; this can be 
easily seen by using the inverse function theorem as stated in Section 2.5. 
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(a) (b) (c) 

Fig. 2.7. 

Examples (1) Consider the mappings f:R^R 2 indicated in Fig. 2.7. 
In (a)/is an immersion (why?) but not an embedding and f(R) is an immersed 
submanifold but not a submanifold. In (b) the figure “ 8 ” is such that the 
arrow segments approach but do not touch the center p. Then /is an embed¬ 
ding and /(#) is a submanifold when given the obvious C°°-structure. Note 
that the submanifold topology is that of a bent open interval and therefore a 
neighborhood of p in the submanifold topology is just a bent open interval 
containing p. However, a neighborhood of p in the topology induced from 
R 2 always contains part of the arrow curves near p. Also the spiral in (c) 
yields an embedding and a submanifold. What can be said about the sub¬ 
manifold topology and the topology induced from R 2 in (c)? 

(2) Consider the torus of Section 2.1 

T 2 = {(exp 2nix, exp 2niy) :0 <x < 1 and 0 < y < 1} 

and define 

f:R-*T 2 :t-> (exp 2niat, exp 2nibt) 

where ajb = a is an irrational number. Then / is injective (by solving the re¬ 
sulting equations and using a is irrational) and /is C 00 . Thus by giving f(R) 
the obvious C°°-structure so that f:R-+ f(R) is a diffeomorphism, f(R) is a 
submanifold. Furthermore/#) wraps around T 2 in a nonintersecting manner 
and is actually dense in T 2 (exercise or see the text of Auslander and Mac- 
Kenzie [1963]). Representing T 2 as a square with opposite sides identified as 
discussed in Section 2.1, we see f(R) can be represented by the line segment 
(x, y) = ( at , bt) and their displacements as in Fig. 2.8. Also we should note 



Fig. 2.8. 
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that points close together in T 2 need not be close in f(R); that is, the topology 
in f(R) is not the induced topology. 

Exercises (1) In general, can one find a one-dimensional submanifold 
of T n which is dense in T n ? 

(2) Show if /: M -► N defines a submanifold and if M is compact, then 
/: is a homeomorphism. (Hint: What can be said about a con¬ 

tinuous map of a compact space onto a Hausdorff space?) 

If z = f(x , y) is a well-behaved function, then it defines a surface M a R 3 
which is a two-dimensional submanifold. For a point p e M we can define, 
in a suitable neighborhood V in R 3 of p , coordinates ( x , y, w), where w = z — 
/(*, jO. Thus the surface is given locally by the equation w = 0. The familiar 
upper hemisphere given by z = (1 - x 2 - y 2 ) 1/2 > 0 is an example of such a 
situation. We have the following generalization of this. 

Proposition 2.11 Let M be an ra-dimensional C°°-submanifold of the n- 
dimensional C°°-manifold N and let pe M. Then there exists a coordinate 
system ( V, z) of N with p e V such that: 

(a) z t (p) = * • * = z n (p) = 0 where the z t are the coordinate functions; 

(b) the set W — {r e V : z m+1 (r) = * • • = z„(r ) = 0} together with the 
restriction of z x ,..., z m to IF form a chart of M with pe W. 

Conversely, if a subset M c iV has a manifold structure with a coordinate 
system at each p e M satisfying the above, then M is a submanifold of N. 

proof Let /: Q -► N be an embedding which defines M —f(Q) and let 
P = /(<?) f° r a unique q e Q. Now let (T, be a chart for p in N and we can 
assume y(p) = 0 in Let £/ be a neighborhood of q = /“ 1 (p) in 2 an d let 
x = y of\U be such that ( U , x) is a chart for q in Q. Thus x(q) e R m and for 
i = 1,..., m we have x i =y i of\U are the corresponding coordinate func¬ 
tions. 

Nowthecompositionj> o/o jc' 1 = /^definesaC°°-functionF: x(U)-*y(T), 
where x(U) cz R m and y(T) c R”, and we can write F in terms of coordinates 

y i =f i (x 1 ,...,x m ) for 

The hypotheses M=/(Q) is a submanifold, }'o/=fox, and x = jo/|t/ 
yield = x t for / = 1, ..., m in the above expression for F. Thus the rank of 
DF(x(q)) is m; that is, the m x m matrix (dfjdxj), i,j= 1, m, is the 
identity. By the inverse function theorem there exists a neighborhood D of 
x(q) with D a x(U) where the first m equations can be locally inverted 

= for 1 = 1, 
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where y t ,...,y m are actually the coordinate functions defined on /» 
x~\D) <= Tbut are also used above to denote “coordinates” in y(T). (Note 
that due to the simple expression of the first m equations y, =f i (x l ,x m ), 
the functions g, can be explicitly computed. What are they ?) 

We now change from the y-coordinates to the z = (z 1 . z„) coordinates 

given by 

Zj = yj for j=\,...,m 

Z ( — yi ~ > • • • > I'm)) • • • > ffmiyi > • • • > .^m)) 

for i = m + 1,..., n. These equations for y’s are defined on/« x -1 (Z)) e T 
and are C x . They form a change of coordinates because z -1 exists locally 
[show det(dzj/5y/p)) # 0] and y ° z" 1 and z ° y -1 are C°° on their domains. 

Let V z>f ° 1 (£>) be the subset of the domain of y in T where z is defined. 

Then Fis a neighborhood of p in N and by unscrambling the definitions of/ 
and gi and using y(p) = 0 we have z(p) = 0. Now the set W in (b) given by 

W = {r e V: z m+1 (r) = • • • = z„(r) = 0} 

contains p and is in M since in terms of the defining equations for z m+1 ,..., z„ 
we see Wcif(x~ 1 (D))<=f(U)cf(Q) = M and also W is open in M. The 
restriction of z t = y t for i = 1,.... m to W equals x t (second paragraph) and 
so are coordinates on W. 

The converse follows from various definitions. 

Remark The above proof contains some machinery which is not neces¬ 
sary in view of our definition of a submanifold and for a more direct proof 
see the book by Bishop and Goldberg [1968, p. 42], However, it can be modi¬ 
fied to obtain the following result which is frequently used as the definition 
of a submanifold [Helgason, 1962; Singer and Thorpe, 1967], 

Corollary 2.12 Let P be an w-dimensional C^-manifold, let N be an n- 
dimensional C°°-manifold with n > m, and let/: P —» N be an injective C°°- 
function. If for every qeP, there exists a chart (U, x) of q in P and there exists 
a chart (T, y) of f(q) = p in N such that the linear transformation 
D(y°fo X -i)(x(q)):R m ^R n 

is injective, then M = f(P) is a submanifold of N provided f(P) is given a 
C°°-structure so that f:P^> f(P) is a diffeomorphism. 

proof We shall use the converse of Proposition 2.11 by showing (a) and 
(b) hold. By a simple translation argument we can assume that x{q) = y(p) — 0. 
Now near x{q) we can represent the composition y °/° a -1 = F in terms of 
coordinates 

y t =/»(*! > • • ■» *m) for i= 
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Since D{y ° x~ i )(x(q)) has rank m we have that some subsystem of m 
equations 

yij=fij(x for 

is such that m x m matrix (df t Jdx k ) is invertible. We can assume this sub¬ 
system consists of the first m equations and consequently can define a function 

F : x(U) -► R m : (x 1 , ..., x m ) -> (y t ,..., y m ) 

where we use the coordinate function to also denote the corresponding point. 
Thus by the inverse function theorem there exists a neighborhood D of x(q) 
and D <= x(U) on which F has a local inverse G. Thus the first m equations 
can be locally inverted 

xi = g£yif->yj for * = 

and we proceed as in the above proof. Note that from the defining equations of 
z we see that x(q) — y(p) = 0 implies z(p ) = 0. 

Remark Let M be an w-dimensional C°°-manifold. Then it can be 
proved that M is diffeomorphic to a submanifold of R n with n<2m+ 1. 
This theorem of Whitney can be found in the work of Auslander and Mac- 
Kenzie [1963]. 

Proposition 2.13 Let M and N be C°°-manifolds of dimension m and n , 
respectively, with m>n. Let /: M -+ N be a C°°-map and for some fixed 
p e N let f~ x (p) ={qe M : f(q) = p}. Let every q e f~ 1 (p) have a chart (U, x) 
in M and let p have a chart (T, y) in N such that D{y °fo x: _1 )(x:(^)) : R m -► R n 
is surjective. Then f~ x (p) is a closed (m — n)-dimensional submanifold of 
M or /~ 1 (p) is empty. 

Proof This follows from the variation of the inverse function theorem 
given in Proposition 1.17 using the inverse image of the set {p} is closed (or 
see the book by Spivak [1965, p. 111]). 

A C°°-map f:M-+N such that for every q e M there is a chart ( U , x) at 
q and a chart (T, y) at f(q) with D(y o/o x~ 1 )(x(q)) surjective is called a sub¬ 
mersion. Thus the injective or surjective nature of D(y o/o x -1 )(;c(/)) deter¬ 
mines submanifolds. 

Exercise (3) If /: R -► R m is C 00 , then show the graph of / given by 
G(f) ={(*,/( 0) : t e R} is a submanifold of R m + 1 = R 1 x R m with the in¬ 
duced topology. Does f\R^R 2 :t-+(t 2 ,t 3 ) define a submanifold? The 
above can be generalized to C°°-functions 
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Definition 2.14 Let M be a C°°-manifold. A C°°-curve in M is a C"°-map 
/from some interval / contained in R into M such that/has an extension/ 
which is a C°°-map of an open interval / => / into M. Thus if / = [a, b\, then 
there exists an e > 0 such that J = (a - s, b + e) and there exists a C°°- 
function/ \J-*M such that f(t) = f(t) for all t e I. Then/is frequently called 
a curve segment in case I = [a, b\. A broken C°°-curve in M is a continuous 
map/: [a, b]-*M together with a partition of [a, b ] such that on the cor¬ 
responding closed subintervals/ is a C°°-curve. 

Examples (3) The map f:R-*R 2 :t~* (t 2 , t 3 ) is a C°°-curve in R 2 with 
a cusp at (0, 0). 

(4) The “wrap around” map on the torus T 2 given in Section 2.3 with 
“irrational slope” is actually a C^-curve which is dense in T 2 . 

(5) The map /: [0, 1] -* R 2 given by 

_ ((*> sin !/0 if ** °> 

J( ) {(0,0) if t = 0 

is not a C°°-curve in R 2 since it does not have a C 00 -extension to an open 
interval containing 0. 

We recall that a topological space M is connected if it satisfies any of the 
following equivalent conditions: 

(1) M is not the union of two nonempty disjoint closed subsets; 

(2) M is not the union of two nonempty disjoint open subsets; 

(3) the only subsets of M which are both open and closed are M and the 
empty set; 

(4) if M = (J a E a , where E a are open and E a n E„ is^empty if a # b, then 
only one of the E a is nonempty; 

(5) if/: M -> N is a continuous map into a discrete set, then/(M) is a 
single point. 

A topological space M is path connected if for every p,qeM, there exists 
a continuous curve/: [a, b] -* M with p =/(«) and q =f(b). We have the fact 
that a path connected space must be connected [Singer and Thorpe, 1967]. 

Proposition 2.15 Let M be a C°°-manifold. 

(a) If M is connected, then every pair of points can be joined by a broken 
C”-curve. 

(b) M is connected if and only if M is path connected. 

Proof Part (b) follows from the preceding remarks and part (a). Thus 
let p e M and for q e M, define q ~ p if and only if q can be joined to p by a 
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broken C°°-curve. Then since ~ is an equivalence relation, M is the union of 
the disjoint equivalence classes 

E p = {qeM :q~p}. 

Now each E p is open in M, for if q e E p , let (U, x) be a chart of M such that 
q e U with x(q) = 0 and x( U) = B m , an open m-ball. Now for each u e U the 
point x(u) e B m can be joined to x{q) by a C 00 -curve A in B m ; that is, X a 
straight line segment. Therefore u e U can be joined to q by the C°°-curve 
x ^ 1 o A and consequently uell can be joined to p by a broken C°°-curve. 
Thus u ~ p, so that U c E p and E p is open. However, since M = (J E q (dis¬ 
joint), we have by condition (4) that all the E q are empty except one. Thus 
every point in M can be joined to p by a broken C 00 -curve. 

Example (6) Let V = R n and let G = GL(V). Then G is an open n 2 - 
dimensional submanifold of R" 2 . Now let 

SL(V) = {Ae GL(V ): det(A) = 1}. 

Then SL(V) is clearly a subgroup of GL{V) and is called the special linear 
group and is sometimes denoted by SL(n, R ). Now SL(V) is a closed sub¬ 
manifold because if we let 

/: GL(V ) -> R — {0} : A -* det(y4), 

then using exercise (5), Section 1.4 for the derivative of det we see that for 
all A e GL(V), D(f)(A) is surjective; that is, of rank 1. Thus by Proposition 
2.13, SZ,(T) =/ 1 (1) is closed and of dimension n 2 — 1. 

We shall now use exp to obtain a coordinate system at the point / e SL(V) 
and then for any point A e SL(V). For g = gl{V) we let 

sl(V) = {X£ g : tr X = 0). 

Thus since tr is linear and tr[X, Y] = tr XY — tr YX — 0 we see that sl(V) is 
a Lie subalgebra of g; that is, sl(V) is a vector subspace of g so that for all 
X, Ye sl(V) we have [X, Y] = XY- YXesl(V). Also for any Xeg, 

A=-(tr3f)/+ f — - (tr X)1 1 =-(trX)/+ Y, 
n L n J n 

where tr y = 0. Consequently dim sl(V) = n 2 - 1. Next note that exp re¬ 
stricted to s/(V) is actually in SL(V), since we have from exercise (3), Section 
1.1 that 

det(exp X) = e tTX = 1 

$ 

if Xe sl{V ). Thus if we let F = expl^K), we have from the proof of Propo¬ 
sition 1.19 that DF( 0) is the identity. Therefore by the inverse function 
theorem there exists a neighborhood U 0 of 0 in sl(V) and a neighborhood Uj 
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of 7 in SL(V ) such that F: U 0 -*• G,: X -» exp is a diffeomorphism of t/ 0 
onto £/j. Thus for t in a sufficiently small interval ( — d, d) of 0 e R and for X 
fixed in sl{V) we see that the map 

exp : sl(V) -» SL{V) : tX -> exp tX 

maps the line segment tX into a C°°-curve segment in SL(V). 

To coordinatize SL(V) by exp we proceed as follows. First as in the re¬ 
marks following Proposition 1.19 we have the local C°°-diffeomorphism 

log: U, - U 0 

and since U 0 is open in sl(V) we find that ( U { , log) is a chart at I in SL(V ) 
[noting that tr(log exp X) = 0]. Now for any other point A e SL(V) we have 
that 

L(A): SL(V) -> SL(V ) :B->AB 
is a diffeomorphism of SL(V) and therefore the set 

L{A)Uj = AUf = {Auu e U t } 

is an open neighborhood of A, using A ~ AI. Let V=AU T and let 
y = log o L{Ay\ Then (V, j) is a chart at A in SL(V ) as shown in Fig. 2.9. 
Finally we remark that G = GL{V ) is not connected; for if it were, then since 
det is continuous, det(G) = R — {0} is connected, a contradiction. However, 
SL(V ) is connected and this follows from Proposition 2.15 and the following 
result. 



Fig. 2.9. 
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Exercise (4) Let P(V) = {A e GL(V) : det A > 0}. Then P(V) is path 
connected. 

Now to show SL(V) is connected we note that the map 
6 : P(V) - SL(V) : A -► (det A)' 1/n A 
is a continuous surjection so that SL(V) is connected. 

Example (7) Again let V = R n and let 

B : V x B(X , T) 

be a nondegenerate bilinear form (symmetric or skew-symmetric). Then the 
adjoint ,4* relative to £ is uniquely given by 

b(ax ; y)«B(jr,^*y) 

for ^4 e End(F). We have the usual rules 

M + 4- <l4* + bB* and ( AB )* = 5*4*. 

Let 

AT = {£ e End(F): £* = B}. 

Then AT is a vector space and a manifold and the manifold dimension equals 
the vector space dimension. Also for any 4eG = GL(V) let 

/: (/-► End(F): A^AA* - /. 

Then f(A) e K and let 

H ={AeG : B(AX,-AY) = B(X, Y) MX, YeV} 

= {AeG:AA*-I= 0} 

=/ _1 (0). 

Then H is clearly a subgroup of G and H is a closed submanifold of G of 
dimension n 2 — dim K as follows. 

To see this we shall use Proposition 2.13. Thus we must show for every 
A e G such tha.tf(A) = 0 that Df{A) : End(K) —► AT is surjective. For any A e G 
and any Y e K, let X = \ YA*' 1 e End(F). Then we shall show 

r= immx) 

so that Df{A) is surjective. Thus 

[Df(A)](X) = lim - [f(A + tX) -f(A)} 
t-» o t 

- lim -{[(/l + tX)(A + tX )* - /] - ( AA * - /)]} 

(->0 t 

= XA* + (XA*)* = Y. 
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To coordinatize H we proceed as follows. Let g — gl(V) and let 
h={Peg: B(PX, Y) = -B(X , PY) all X, Ye V} 

= {Peg:P*= -P}. 

Then h is a Lie subalgebra of g\ that is, h is a vector subspace of g and for 
P, Qeh we have [P , Q] = P£? — £?P e h. Thus for P, Qe h and a, b e R we 
have 

(aP + 60* = tfP* + 60 = -(aP + 60 
so that h is a subspace and 

[P, 0* - (P0* - (0)* - -iFQ - QP) 

so that [P, Q] e h as desired. 

Now for any P eh we have for all X, Y e V that 

B((exp P)X, (exp P)Y) = B(X, (exp P)*(exp P)Y) 

= B(A", (exp P*)(exp P)Y) 

= B(AT, exp(—P)(exp P) Y) = B(X , Y). 

Thus exp : h H so that as in the preceding example there exist a neighbor¬ 
hood l/ 0 of 0 in h and a neighborhood I/j of / in H such that exp : £/ 0 -* £// 
is an analytic diffeomorphism and (U n log) is a chart at / in H which induces 
the chart ( AUj , log olfi)" 1 ) at v4 in //. Also if for any Peg we demand that 
the C^-curve R : t-+ exp tP actually be in H for t in an interval about 0 
in R , then by differentiating the formula B((exp tP)X , (exp fP)Y) = B(X, Y) 
we obtain P eh using example (1), Section 1.2. Note that the manifold di¬ 
mension of H equals the vector space dimension of h. 

There are various subcases depending on B. 


B Symmetric (1) (i) Let B be positive definite; that is, B{X, X ) = 0 
implies X = 0. Thus there exists a basis e x ,..., e„ of V such that if X = JX e t , 
Y = Y,yi e i> then *(*, = 2 Xy»- In this case H is called the orthogonal 

group and denoted by 0(n). We also note that the vector space K = {B e 
End (V) : B = B*} is just the set of symmetric matrices and has dimension 
n{n + l)/2. Thus the manifold dimension of H = 0{ri) is n 2 — n(n + l)/2 = 
n(n — l)/2. 

Now for A e 0(n) 7 A A* — I yields (det A) 2 = 1 so that det A — ± 1. Thus 


noting 



0 


e Q(n\ 


1 


we have, since det: 0(n ) R — {0} is continuous, that Q(ri) is not connected. 
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Let 


SO(ri) = {A e 0(n) : det A = 1} 

= O(n) n SL(n, R) 

which is called the special orthogonal group. We know that SO(n ) is also a 
manifold of dimension n(n - l)/2 and we shall show later that SO(n ) is con¬ 
nected. In this case the Lie algebra h n s!(V) associated with SO(n ) is denoted 
by so(n). ' 

(ii) Now assume the general form for the nondegenerate form B; that is, 
there exists a basis/j,... ,/„ of V such that for X = £x,f t , Y = $>,/,, then 

B(X,Y)=-t x t y i+ £ Xt y t 

i— 1 i = p+1 

[Jacobson, 1953, Vol. II]. In this case the group H n SL(n, R ) is frequently 
denoted by SO(p, q), where/? + q = n and the Lie algebra h n sl(n) is denoted 
by so(p, q). 

Next we shall consider V — R" as column vectors with 



w 


y } 

x = 


, Y = 



*n_ 


.y«_ 


relative to the basis f t ,...,/„ so that we can write B(X, 7) in block form 

B(x, = JJy = A:‘By, 

where t denotes transpose and I p , I q are the appropriate identity matrices. 
Then for 


P E so(p y q) —{P e sl(n ): B(PX, Y ) = -B(X, PY )} 

we have 


0 « (.PXYB Y + X t B(PY) = X^B + BP) Y. 

Thus P l B + BP = 0 and if we partition P into appropriate blocks 



then P[ 1 - — P n , P f 2 2 = — P 22 , Pi 2 = P* 2 i , an dP 12 arbitrary. Thus we obtain 
the form of the Lie algebra so(p , q) and that it is of dimension 

P(P - l)/2 4- q(q - l)/2 + pq = «(« - l)/2 


using p + q = n. 
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As before we can use (U n log) to coordinatize SO(p,q) where log: 
Uj^UoCi so(p, q ). Thus we see that SO(p , q) is an n(n - l)/2-dimensional 
manifold. 

Exercise (5) Show SO(p,q) is not connected (Hint: Investigate 
matrices of the form 



where A u are orthogonal matrices of the appropriate size satisfying 
det det A 22 = 1)- 

B Skew-symmetric (2) Thus B(X\ Y ) = —B(Y, X ) and there exists a 
basis /i of V such that n = 2p and using the preceding notation 

B(X , Y) has the block form [Jacobson, 1953, Vol. II] 

B(X, Y) = Y = (x k y k+p - x k+p y k ). 

In this case we shall consider H n GL{2p, R) which is freqently denoted by 
Sp(p, R), where n = 2 p, or Sp(p), or Sp(n, R) and is called the symplectic 
group. The Lie algebra associated with Sp(p, R) equals h n gl(2p) and is de¬ 
noted by sp(p, R). Next for P e sp(p, R) we put it into block form and find 



where P 22 = -P' n , P\ 2 =^ 12 . p n =^ 21 , and P n arbitrary. Thus p 2 + 
P(P + l)/2 + p(p + l)/2 = 2p 2 + p = dim sp(p, R) which equals the manifold 
dimension of Sp(p , R ). 

For future reference we present a short list of important Lie groups and 
Lie algebras in Tables 2.1 and 2.2. We will describe the groups and algebras 
entirely in terms of matrices. For convenience we include the groups and 
algebras that have been previously discussed. First define I Pt q e GL(p + q , R ) 
and J n e GL(2n, R ) by 



In the definition of the unitary group U(n) the matrix X = (a fi ) is the com¬ 
plex conjugate matrix of X — 



4. TANGENTS AND COTANGENTS 


61 


TABLE 2.1 
Lie Groups 


GL(n , C) 

nonsingular « x « complex matrices, 

GL{n, R ) 

nonsingular n x n real matrices. 

SL{n , C) 

{XeGL{n, C) : det(Af) = 1}, 

SL{n, R) 

{Xe GL{n, R) : det(X) = 1}, 

0(n, R) 

{XeGL{n,R): X' = X~'}, 

SO(n , J?) 

0(«, R) n SL(n , R), 

0( P ,g) 

{Xe GL{p + q, R) : / p> , A"/," J= AT -1 }, 

SO(p,q) 

£>(/>, 9 ) n SX(/> + q, R), 

Spin, C) 

{Xe GL{2n, C ): X'J„~ l = X-'}, 

Spin, R) 

Spin, C) n GLin, R), 

U(n ) 

{XeGL(n, C) : X'= X' 1 }, 

SU(n) 

U{ri) n SL(n, C ). 


TABLE 2.2 
Lie Algebras 


glin, C) 

/i x /z complex matrices, 

glin , /{) 

n x « real matrices, 

i/(«, C) 

{Xeglin, C) : tr(AT) = 0}, 

<s/(«, /?) 

sl(n, C ) n gl(n , i?), 

so(rt , 7?) 

{X e gl{n, R) : X‘= -X), 

?) 

{X e gl{p + q,R): /„, , X‘I~ J = - X), 

5p(rt, C) 

{Xegl(2n, C) : J„ X‘J n ~ l = —X], 


spin, C) n gl(2n, R), 

w(«) 

{X e glin, C) : X'^ ~X), 

su(n ) 

«(«) n slip, C). 


For more details on matrix groups the reader should consider the work of 
Chevalley [1946, Chapter 1] and Helgason [1962, p. 339], 


4. Tangents and Cotangents 


Let M a R 3 be a well-behaved surface given by the differentiable function 
z =f(x, y ) and going through the point p = (0, 0, 0). Then from calculus the 
tangent plane to M at p is given by the equation 

2 = x <3/(0, 0)/dx + y 5/(0, 0 )/dy for x,yeR. 
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If this plane is cut by the plane x = 0, then the equation of the line of inter¬ 
section is z = y df( 0, 0 )/8y and we obtain the vector (0, 1, 8f( 0, 0)/oy) in the 
tangent plane. Similarly (1, 0, df(0, 0)/dx) is in the tangent plane. These two 
vectors which give the tangent plane are determined by the partial differentia¬ 
tion off Thus we are led to study operators on real-valued functions which 
have the properties of differentiation and we now abstract this situation to 
manifolds. 

First recall that for a C°°-manifold M and for p e M the set F(p) = C°°(p) 
of C“-functions at p e M is an associative algebra using the pointwise opera¬ 
tions: Let U, V be open sets of M containing p and let/: U-* R,g : VR 
be in F(p). Then define for a,beR 

af +bg : C/n V^>R:q^> af{q) + bg{q) and fg \ U n V^R.q -*f(q)g(q). 

Definition 2.16 A tangent at p e M is a mapping L : F(p) -*• R such that 
for all /, g e F(p) and a,beR, 

(a) L(af+bg)=aL(f) + bL(g)- 

(b) L(fg) = L(f)g(p) +f(p)L(g). 

That is, L is a derivation of F(p) into R. Let T p (M), or T(M, p). or M p denote 
the set of tangents atpeM. 

Example (1) For p e M = R m and for fixed X e R m the map 
Lx : F(p) —► R :/-> [Df(p)](X) 

is a tangent at p using Proposition 1.3 concerning the product rule. 

Proposition 2.17 Let M be an m-dimensional C°°-manifold and let p e M. 

(a) Iff, g e F(p) and f(q) = g(q) for all q in a neighborhood U of p, then 
L(f) = L(g) for all L e T(M, p). 

(b) T(M, p) is a vector space over R. 

Proof (a) The function k defined on U by k(q) = 1 for all q e U is in 
F(p) and we have for any L e T(M, p) that 

L{k) =L(k 2 ), using 1 = l 2 
= L(k)k(p) + k(p)L(k) 

= 2 L(k). 

Thus L(k) = 0. Now we have f = kf — kg on U and therefore 
L(f) = L(k)f(p) + k(p)L(f) 

= L(kf)=L(kg) 

= L(k)g(p) + k(p)L(g) = L(g). 
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(Can the “bump function” of exercise 6, Section 1.4 be used above?) 

(b) For L x and L 2 in T(M,p) and for a,beR we see that aL x + bL 2 is 
a linear operator; that is, it satisfies (a) of the definition. Also for f,ge F(p) 
we have 

iaLy + bL 2 ){fg) = aL x (fg) + bL 2 (fg) 

= a(L x (f)g(p) +f(p)L x (g)) + b(L 2 (f)g(p) +f(p)L 2 {g)) 

= (aL x + bL 2 )(f)g(p) +f(p)(aL x + bL 2 )(g). 

Thus aL x + bL 2 e T(M, p). 

We shall now show that the vector space dimension of T(M, p)ism; that 
is, equal to the manifold dimension of M. We shall do this by taking a chart 
( U , x) of M at p such that for u t : x(U ) —► £ where the w* are coordinate func¬ 
tions of R m , the partial derivative operators Z> f (*(p)) = djdu^xip)) in R m for 
/ — 1, ..., m eventually yield a basis £,(» for / = 1, ..., m of T(M, /?). 

Thus let (U 9 x) be a fixed chart at p in M and let/e F(/?), where/is defined 
on an open neighborhood V of p with /: V-> R of class C 00 . Now /is of class 
C 00 on the neighborhood t/nFct/so that we can write/in terms of the 
fixed coordinates x = (x 1 ,..., x m ) where — u t o x. Therefore for D — x(U) 
an open set in R m the function g =/o x" 1 : D -► K is C 00 on £). Thus /= 

g o x = g(x t ,..., x m ) where we write # = .tf J on Z). We now define 

the maps 

: F(p) - F(p):/-» c(/o x" 1 )/^,«. x 

which are called coordinate vector fields relative to (U, x); that is, we form the 
real-valued C°°-function h = d{f ° x~ 1 )/du i = dg/dUi defined on D to obtain 
the function ho x which is in F(p). Sometimes the notations 

d t = 8/dXi and dj = 8fldx t 

are used. The mapping d t has the following properties for / g e F(p) and 
a, b e R, 

(1) d i (af+bg)=ad i f+bd i g ; 

(2) d i (fg) = (d i f)g+f(d i g). 

Note that for the coordinate functions Xj = Uj o x on U we have from the 
above definition 

d { Xj = duj!du i = Sy 

and for the constant function f(q) — c for q e U, we have d f / = 0. 

Next we define an element di(p)e T(M,p) as follows: For f e F(p) we 
obtain d { f e F(p ), then evaluate (^/)(p) e K. Thus 

(SJ)(p ) 
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and from (1), (2) above we see d*(p) satisfies the definition of a tangent at p. 

Proposition 2.18 Let ( U , x ) be a fixed chart at p e M where x = (x t , ..., 
x m ). Then the vector space T(M, p) has basis 


Slip),d m (p) 

and any L e T(M, p) has the unique representation 

L = 

i= 1 

where a t — L(x t ) e R. Thus the manifold dimension of M equals the vector 
space dimension of T(M,p). 

Proof We can assume that x(p) — 0 since a translation x i =y i + t 
yields d/dxi — d/dy Now from Section 1.4 it is easy to see that any real¬ 
valued C^-function g defined on D = x(U) has the Taylor’s formula expan¬ 
sion about the point 0 = (O,...,O)eDcR m 

m 

g = g(Q)+ Iu i9i , 

where u t are coordinates on R m and g t are C°° at 6 e D. Thus for the real 
valued function / = g ° xs F(p) as previously discussed we obtain on U 

f=g° x = g(Q) + Z(uj o x)(gj o x) 

= g(d) + 'Zx j fj, 

where fj—gj ° x e F(p). We apply d t (p) to this equation 

Siip)f= 

= 0 + E di(Xjfj)(p) 

= Z [(SiXj)(p)fj(p) + Xj(p) difjip )] =fi(p) 
using d i x J = S u and Xj(p) = 0. Next we apply L to the same equation 

L(f) = L(g(0)) + Z L(Xjfj) 

= 0 + Z [(Lxj)fj(p) + Xj(p)Lifj)} = Z dj Sj(p)f 

using the preceding equation. Thus Z, = £ a j dj(p)- The elements 8 X (p ),..., 
d m (p) in T(M, p) are linearly independent. For if £ a s d/p) = 0, then applying 
to coordinate functions, 

0 = 0(X;) = Z ajdj(p)(xi) = di 

using dj(p)iXi) = SjXiip) = d tj . 
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Remark Let (£/, x ) and (V, y) be charts at pe M. Then on V n V we 
have the coordinate functions x t and y } defined. Then 8/dx t (p) and d/dy/p) = L 
are in T(M,p) and according to Proposition 2.18 we can represent L by 

m 

d/dyjip) = z dxjdy/p) d/dx^p), 

i — 1 

where the matrix (( dxi/dy k )(p )) is the nonsingular Jacobian matrix obtain by 
writing x t = XjOi,..., y m ). Thus we have the matrix for the change of basis 
in T(M, p) when we change charts at p e M. 

Examples (2) For p e M = R m and X e R m , let L x eT(M,p) be the 
tangent given in example (1) of this section. Then the map R m -> T(M,p) : 
X^L x is linear because L aX+br (f) = Df(p)(aX +bY)= ( aL x + bL Y )(f). 
Also this map is an isomorphism. (Why?) Thus at each point p e M = R m we 
can attach the tangent space which is isomorphic to M itself. 

(3) Let A be a group with identity e and let (x, y) = xyx _1 y~ x be in N 
and for A, B subsets of N let (A, B) be the subgroup generated by all com¬ 
mutators (x, y) with xeA, ye B. Then for A, = (N, JV), N k+i =(N k , N ) we 
have 

N 3 N k => ■ • • ^ N k => 

and call N nilpotent if there exists k with N k ={e}. Now let N be the sub¬ 
manifold of SL(V) consisting of the nilpotent subgroup given by the set of 
triangular matrices 

'1 *' 

1 

_0 1 

where * denotes arbitrary elements from R. We shall now show that the vector 
space Tj(N) is isomorphic to the vector space of all triangular (nilpotent) 
matrices 

'0 *' 

0 

_0 ' 0. 

and denote this vector space of matrices by n. To show the isomorphism we 
shall use the exp mapping by showing exp : n -■* N is locally invertible. Now 
for A e n we see that the associative products A 2 , A 3 ,A k are all in n and 
since A is a nilpotent matrix A m = 0. Thus we see that 

exp,4 =/+ A + ••• + A m ~ 1 l(n i - 1)! 
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is in N so that exp : n^> N. Also D exp(O) is invertible. Thus as before, there 
exist a neighborhood U 0 of 0 in n and a neighborhood of Uj of / in N so that 
( Uj , log) is a chart in A at /. Consequently we have, since U 0 is open in n, 

dim Uj = dim U 0 = dim n. 

However dim Tj(JV) = dim Uj and since the vector spaces T f (N) and n have 
the same dimension, they are isomorphic. 

Exercises (1) Show that n is a nilpotent Lie algebra. Thus first show 
[/i, «] = QT [A it Bi]: A i9 B ( e n) a n. Next define 

n x = [«, n] and n k+1 — [n k , n] 

and note that n =3 n 1 =>•*•:=> n k * * *. So finally show n p = 0 for some p. This 
will show that the nilpotent (Lie) group N is such that the tangent space 
Tj(N) is vector space isomorphic to a nilpotent Lie algebra n. 

(2) Let M , N be C°°-manifolds and let p e M, q e TV. Then show 

T(M x jV, (p, g)) £ T(M, p) x T(JV, g) £ T p (M) © T q (N). 

Recall that if V is an m-dimensional vector space over R, then its dual 
space V* = Hom(K, R). Elements of V* are called linear functionals and the 
map 

Vx V*-+R:(X,f) ^f(X) 
is bilinear and is frequently written 

f(X ) = <X 9 f>. 

Now for any basis X x ,..., X m of V we have the dual basis f 1 * 3 ... ,/ m * given 
by 

OT = <*«.//> = 

From this we see any JeF can be written in the form 

x=Y d f i *(X)x i . 

Definition 2.19 The cotangent space at p g M is the dual space of T P (M) 
and is denoted by or T*(M,p), or M p *. The elements of T*(M,p) 

are frequently called differentials at p and T*(M, p) is also called the space of 
differentials at p. 

Now let /g F(p ) and define the element df e T*(M, p) by 


df:T(M,p)^R:L^L(f); 
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that is, df(L) = <L, df} = L(f). Sometimes a more specific notation df(p) or 
df p will be used. In particular if ( U, x) is a chart with x t the coordinate func¬ 
tions, then a basis for T{M, p) is given by d t (p),8 m (p) and a dual basis for 
T*(M,p) is given by dx^p),dx m (p) because they satisfy 

(8 t (p), dXj(p)y = d(Xj)/dXi(p) = S,j. 

Now for any L e T(M, p) and any f e F(p) we have from Proposition 2.18 
that L = Y, L(x j) d t (p) and therefore 

df(L)=Hf) = ^L(x i )(d i f)(p) 

= I {d t f)(p)L( Xi ) 

= Z ( 8if)(p ) dXiiL); 

that is, 

df(p) = Z dx t (p). 

Combining various facts we have the following result. 

Proposition 2.20 Let M be an w-dimensional manifold and let /i,..., 
f r e F(p) for peM. 

(a) Each feF(p) equals g(f x ,... ,/ r ) on a suitable neighborhood 
V = V(f) of p, where g : R r -* R is of class C°° if and only if df\(p ),..., df r (p) 
generate the cotangent space T*(M, p). 

(b) The functions/j, (that is, r = m) are the coordinates of some 

chart ( U,f ) at p where/= (/j,... ,f m ) if and only if the set df^p),dfjp) 
is a basis of T*(M, p). 

Proof (a) Let (£/, x) be a chart at p and suppose each /e F(p) equals 
g(J \on V n U. Then each of the coordinate functions 

= 9if \»• • ■ ,fr) 

and therefore dx i = ^ <\gj(p) df k . However, since the dx’s generate T*(M, p), 
the df’s also generate T*(M,p). Conversely, assume the df’s generate T*(M,p ) 
and represent f t in coordinates 

fi = f*i(x ! ,...,x m ). 

Then we obtain 

dfi = Y. d khi(p)dx k , 

i = 1,..., r. Now since the df’s generate T*(M, p) the m x r matrix (d*A f (p)) 
has rank m <r. Thus we can assume that there exists a system of m functions 

fij = hij(x j,..., x m ) for j=\,...,m 










68 


2 . MANIFOLDS 


which define a function F : R m R m : x = (x t ,..., x m ) (/q(x), ..., F m (x)) 
where Fj(x) = /}/*). Also £>//*(/?)) is invertible so that by the inverse function 
theorem we can write locally 

x t = k i(A . • • • JO* 

where k i :R m -+R are C 00 . However, each f e F(p) equals , ..., x m ) 
locally, where G is C 00 , and using the above expression for x's in terms of ff s 
we have the results. 

To show (b) just note that for r = m we have df x> ..., df m generate T P *(M) 
if and only if they form a basis. Then we can use the above equations expres¬ 
sing x t = , ... ,/J and/} = hj(x x . xjtosee/i, ... ,/ m are coordin¬ 

ates for some chart at p e M. 

Exercise (3) Let G be open in ,R m and let /: U-> R be of class C®. 
Compare /)/(p) and df(p) for pe U. 


5. Tangent Maps (Differentials) 

In the preceding section we considered a C°°-map g from the manifold M 
into the manifold R and noted that the differential df(p) is a linear map from 
the tangent space T(M,p) into the vector space R ^ T(R,f(p )); this isomor¬ 
phism uses example (2) of Section 2.4. We shall generalize this situation by 
showing that a C°°-map f:M^N between two manifolds induces a linear 
map df(p ): T(M, p) -► T(JV,/(p)). However, by means of coordinate functions 
this generalized situation reduces to that of the preceding section. 

Definition 2.21 Let M and N be C°°-manifolds and let /: M -► JV be a 
C°°-mapping. The differential of / at p e M is the map 

df(p):T(M,p)-*T(N,f(p)) 

given as follows. For L e T(M,p) and for g e F(/(p)), we define the action of 
df(p)(L) on g by 

[df(p)(L)](g)=L(gofy 

Remarks (1) We shall frequently use the less specific notation df for 
df(p) when there should be no confusion. Also we shall use the notation 

Tf= Tf(p ) - df(p) 

and also call Tf(p ) the tangent map of f at p. This notation is very useful in dis¬ 
cussing certain functors on categories involving manifolds. 





5 . TANGENT MAPS (DIFFERENTIALS) 


69 


(2) We note that for g e F(f(p)) the function g °/is in F(p) so the opera¬ 
tion L(g of) is defined. We must next show Tf(L) is actually in T(N,f(p) ) by 
showing it is a derivation. Thus for g, he C°°(/(p)), 

Tf(L)(ag + bh) = L(a(g of) + b{h of)) 

= aL(g of) + bL{h of) 

= a[Tf(L)](g) + b[Tf(L)](h) 
and the product rule is also easy. 

The following result shows that df{p) is the correct generalization for 
Df{p) of Section 1.2, where/: U -* W is a C°°-map of an open set U in R m and 
W is some Euclidean space. 

Proposition 2.22 Let/: M-> JV be a C^-map of C°°-manifolds and let 
p e M. Then the map 

Tf(p) : T(M, p) T(NJ(pj) 

is a linear transformation; that is, Tf(p) e Horn (T(M,p), T(N,f(p))). Further¬ 
more if ([/, x) is a chart at p and ( V, y) is a chart at f(p), then Tf(p) has a matrix 
which is the Jacobian matrix of /represented in these coordinates. 

Proof Let X, Y e T(M, p). Then for a, b e R and g e /•'(/( p)) we have 
[Tf(aX + bY))(g) = (aX + bY){g of) 

= aX{g of) + bY(g of) 

= [a Tf{X) + b Tf(Y)](g) 

so that Tf{aX + bY) = a Tf(X) + b Tf(Y). Next let x = (jCj. xj and 

y — Oh»• • •. y n ) be the given coordinate functions so that we can represent 
/ in terms of coordinates in the neighborhood V by 

fk=yk°f=fk(x for k=\,...,n. 

Now let djdx i = d { (p) and d/dyi = 3,(/(p)) determine a basis for T(M,p) and 
T(N,Ap)), respectively. Thus to determine a matrix for Tf we compute its 
action on the basis d/8x t in T(M, p). Let 

Tf(d/d Xi ) = £ bjid/dyj 

j 

be in T(N,f(p)). Then we evaluate the matrix (b Jt ) using the fact that y k e 
F(f(p)) as follows 

dfjdx,(p) = d i (p)(y k of) 

= [T/(a i (p))](y,) 

= Z b a ftyd/dyj = b ki 

j 

using d(y k )/dyj = 8 kJ . Thus (£> yi ) == ( df/dx^p )) is the desired Jacobian matrix. 
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Proposition 2.23 (Chain rule) Let M, N, and P be C°°-manifolds and let 
f:M^N and g:N-*P 
be C°°-maps. Then for p e M, 

T(g of)(p) = Tg(f(p )) • Tf(p) 

which is a composition of homomorphisms of tangent spaces. 

Proof Let X e T(M, p) and h e F((g o f)(p)). Then using definitions and 
df(p)X e T(N, f(p)) we have 

[T(g°f)(p)Xm = X((hog)of) 

= [df(p)X](h°g) 

= [dg(f{p))(df{p))X](h) 

= [Tg(f(p)) ■ Tf(p)X](h). 

Remarks (3) If U is open in a C°°-manifold M, then U is a C°°-sub- 
manifold such that the inclusion map i : £/ -► M : x x is C®. Also for we U, 
Ti(u ) : r(£/, u)^T(M, u) is an isomorphism and we identify these tangent 
spaces by this isomorphism. 

Many of the preceding results on submanifolds can be easily expressed in 
terms of tangent maps and are usually taken as definitions. Thus let M and 
N be C^-manifolds of dimension m and n, respectively, and let 

f:M^N 

be a C°°-map. Then we have the following results. 

The inverse function theorem can be stated as follows: If p e M is such that 

Tf(p):T(M,p)^T(N,f(p)) 

is an isomorphism, then m = n and / is a local diffeomorphism. Thus there 
exists a neighborhood U of p in M such that 

(1) f(U) is open in N; 

(2) f(U) is injective; 

(3) the inverse map / _1 :f(U) -> U is C®. 

We now consider separately the injective and surjective parts of the above 
homomorphism Tf(p ); this was discussed in Section 2.3. 

We have/is an immersion if and only if Tf(p) is injective for all pe M. In 
case/is injective,/is an embedding. Also /(M) is a submanifold of A if /is an 
embedding and if /(M) has a C°°-structure such that/: M f(M) is a diffeo¬ 
morphism. Thus from preceding results we have that the following are 
equivalent: 
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(1) Tf(p ) is injective; 

(2) there exists a chart ( U , x) at p in M and a chart ( V , y) at/(p) in A 
such that m <n and x f = y* °/for / = 1,..., m and y i °/= 0 fory — ra + 1, 

• • • j 

(3) there exists a neighborhood t7 of p in M and a neighborhood V of 
f(p) in A and there exists a C°°-map g : V-> U such that f(U) c= V and g of 
is the identity | U. 

A C°°-function f:M-+N satisfying (1) at peM is called regular at p. 
If/is regular at every p e Af, then it is also called a regular function. 

We have / is a submersion if and only if Tf(p ) is surjective for all p e M. 
Also the following are equivalent: 

(1) Tf(p ) is surjective; 

(2) there exists a chart (U, x) at p in M and a chart ( V , y) at/(p) in A 
such that m>n and x t = y f o/ for i = 1,..., n ; 

(3) there exists a neighborhood U of p in M and a neighborhood V of 
f(p) in N and a C°°-map g : V U such that f(U) => V and/° ^ is the iden¬ 
tity | K 

Using the surjective nature of 7/(p) we reformulate Proposition 2.13 and 
construct submanifolds using the following version of the implicit function 
theorem: Let/: M -» N be a C°°-map of C°°-manifoIds and let m = dim M ;> 
dim N = n. Let # e/(Af) be a fixed element and let 

f~\q) = {peM :f(p) = q}. 

If for each p e/ -1 (g) we have Tf(p): T(M,p ) - >T(N,f(p )) is surjective, then 
f-'(q) has a manifold structure for which the inclusion map i: 
is C 00 . Thus f~ 1 (q) is a submanifold of M. Furthermore the underlying 
topology of the submanifold f 1 (q) is the relative topology and the dimension 
of f~ 1 (q) is m — n. 

Examples (1) We next consider the special case of f:M^N where 
M = R or N = R. First let N = R ; that is, fe C X (M). Then combining the 
notation of Sections 2.4 and 2.5 we have Tfip) = df(p) and for X e T(M,p) 
we have Tf(p)(X) e T(R,f(p)) = R. Thus for u: R -*• R : t -* t the coordinate 
function on the manifold R, we have for some ae R 

Tf(p)X = a(d/du) 

and as before a = a(d(u)/du ) = [Tf(p)X](u) = X(u of) = X(f), using u(t) = t. 
Consequently the map 

T(R,f(p)) -* R : a(d/du) -*■ a 
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is the isomorphism which yields 

Tf(p ): T(M,p) -* R ; 

that is, which yields the cotangent space. 

(2) Next consider formulated in terms of curves. Thus let 

I = (a, b) and let a : /-+ M be a C“-curve which admits an extension a : (a — e, 
b + e) M (Definition 2.14). The tangent vector to a at t e I is denoted by 
A(t) and defined by 

«(0 = [Ta{t)](djdu), 

where u : R -* R is the coordinate function discussed above. 

Now let Xe T(A/,p). Then there exists a curve a where I is an 

interval containing 0 e R such that a(0) = p and d(0) = X, for let (£/, x) be a 
coordinate system at p with x(p) = 0 and find a curve p : R -*• x(U) cr R m with 

fl°) = 0 and $(0) = [T^p)]^); that is, ft a straight line. Then a = x -1 o (} is 
the desired curve 

«(0) = [Tix' 1 o p)(0)](d/du ) 

= Tx'^xip)) ■ >5(0), using the chain rule and x(p ) = 0 
= Tx-\x(p )) • [Tx(p)](X) = Z 
and 

a(0) = x- 1 (m)=p. 

Also for/e F(p) we have 

X(f) = d(0 )(f) 

= [(Ta(0))(d/du)](f) = d/du(0)(fo a). 

Let ((/, x) be a chart on M and let a : (a, b) Uc M be a C"°-curve as 
above. Then for t e (a, A) we can represent 

d(t) = [7d(0M^) = I a* d t (a(0) e T(M, a(l)) 

and evaluate the coefficients a, = <z,(f) using the dual basis of differentials as 
follows, 

= dx^d) 

= notation 

= d(x ( o a )(d/du), chain rule 

= d/du(xi o a), 

where we use the definition of differential of a function applied to a tangent 
(note paragraph following Definition 2.19). Thus as in calculus the tangent 
vector to a curve a is obtained by differentiating its coordinate representation. 
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(3) Consider the special case when M = G =GL(n, R ). We shall con¬ 
struct an explicit vector space isomorphism of g = gl(n, R) onto T^G). Thus 
for any fixed X e g let 

a : R-+G : t -► exp t X. 

Then a = a and define an element e T X (G) by 

x(f) = m(f) 

for any /e F(I). From the preceding example we actually have X e T 2 (G) 
since a(0) = I. Next note that 

X(/) = [(Ta(0)){d/du)](f) 

= d/du(0)(fo a) 

= lim [/(exp tX ) -/(/)]/< 

0 

= [Z>/(/)](X>. (*) 

Now define the mapping 

cp : g ^ T,{G): X^X, 

where q> is well defined and for X, Y e g and a,b e R we use Eq. (*) to obtain, 
for any /e /’(/ ), 

+ bY)(f ) = aX + b Y(f) 

= [Df(l)](aX+bY) 

= aDf(I)(X) + bDf(I)(Y) 

= aX(/) + W) 

= [a<p(X) + MW) 

so that <p is a vector space homomorphism. Next suppose X = 0 and let 
{m=n 2 ) be coordinates in g(^R m ) corresponding to a basis 
X t ,X m of g. Let X = £ X t e g with cp(X) = X = 0 and let f t = 
° log e F(/) as previously discussed. Then/(/) = 0 and since X = 0, 

0-x/i 

= lim L/iCexp tX) -fi(I)]lt, Eq. (.) 

<-*0 

= X, 

so that X = 0 and (p is an isomorphism. We frequently omit this isomorphism 
and just use the most convenient identification for a given problem. 

(4) Let /: G -> G be a C 00 -automorphism of G = GL(V). Then from 
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Section 1.6 we see that the “tangent map” Df(I) is an automorphism of the 
Lie algebra g = gl{V). Thus relations on the Lie group are translated to 
relations on the Lie algebra by the tangent map. 


6. Tangent Bundle 


In this section we shall show how to make the collection of tangent spaces 
of a manifold into a manifold. We also discuss mappings of such manifolds 
and use them to define vector fields in the next section. 

Definition 2.24 Let M be a C°°-manifold of dimension m and let 
T(M) = p): pe M} 

which is a disjoint union. We call T(M) the tangent bundle of M. 

We now make T(M ) into a manifold (Fig. 2.10). We shall frequently 
denote the points of T(M) by the pairs (p, Y) where p e M and Ye T(M,p ); 
thep is unnecessary in this notation but convenient. First T(M) is a Hausdorff 
space as follows. Let 


n : T(M) -> M : (p, Y)-*p 

be the projection map. For ( p , Y) e T(M) let (U, x ) be a chart at p in the atlas 
si of M. Then n~\U) ={(q, X) eT(M) : q eV}. Now if (q, X) e n~\U), 
then in terms of coordinates x(q) = ( Xl (q ),..., xjq)) and X = £ a, 8/dx/q), 
where a t = aj(q). The map 

<t>u • 71 l (U) -► R 2m : (q, X) -► (x t (q),x m (q), a 1 ,...,a m ) 

is injective and there is a unique topology on T(M) such that for all ( U, x) e s4, 
the maps <f> u are homeomorphisms (why?). This topology defined by the sets 
n 1 (U) can easily be seen to be Hausdorff using the fact that M and R m are 
Hausdorff. Also note that since M has a countable basis of neighborhoods, 
then so does T(M). 

Next we define a C°°-atlas on T(M) so that the projection map 
n : T(M) -► M is a C°°-map. Thus for each (p, Y) e T(M) let n~'(U) be a 
neighborhood of (p, y) where (U, x) is a chart at p and let 4>(U) = <f) v : 
Ti l (U) -> R 2m be the above homeomorphism. We claim that (n~ 1 (U), 4>(U )) is 
a chart at ( p, Y). Thus we must show any two such coordinate neighborhoods 
are compatible. Therefore, let (U , x), (V, y) be charts at p where the x and y 
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are C°°-related by x = f(y ) on V n V; that is, /= x °y 1 . Now in terms 
of coordinates let x k =f k (y 1 yj, 

<KV) ~ (Zj , . . . , Z m , Z m + j , » • - , Z 2 m), (^1 9 • • • 9 > ^m + l > • • • * ^2m) 

where 

Zf(q, X) = Xi(q) for 
z J+m (q, X ) = flj for ; = 1 ,..., m, 

and similarly w ; (<jr, X) = y t (q) for / = 1, ...,/m and w j+ m (<y, A) = />, where 
X=Zb i d/dy i (q). 

Now for (< 7 , X) e n 1 (U) n n 1 (V) we have first for 1 = 1,..., m 
z,(q, X) = *;(<?) =/i(yi(g),..., yjq)) =/)(w' 1 (^, Z),..., w m (^, X)) 

so that the first m coordinate functions are C°°-related. Next for j = 1,.,., m 
and for X = J] a f d/dx^q) we note that 

W* X) = fljfo) = *(*,) = dXj(X) 

and similarly ^(X) — w> J+m (#, X) for,/ = 1,, m. Thus by the transforma¬ 
tion law for differentials (note remark following Proposition 2.18), 

2 J+m (q, X) = dxj(X) = £ d Xj /dy k (q) dy k (X) 

k 

= 'Lfjk(yi(q), ■ ■■,y m (q))dy k (X) 

k 

= Y.fjk{ w i(q> X),..., w m {q, X))w m+k (q, X) 

k 

which is a C°°-relationship where f ik = dfi/dy k recalling x t = f£y ±,..., y m )- 
Thus all the coordinates are compatible. The Hausdorff space T(M) with the 
maximal atlas determined by the above charts is a C 00 -manifold and we shall 
always consider the tangent bundle with this C°°-structure. 

Finally we note that the projection map n : T(M) M : (p, Y) -+p is C 00 . 
For let ( U , x) be a chart at p and let = (z t ,..., z 2m ) be a coordinate 
system at (p, Y ) as above. Then in terms of coordinates, x t ° n(q , 2f) — z t {q , X) 
for ? = 1 , ..., m which shows that the coordinate expressions x i ^n = z t are 
C°° (see Fig. 2.10). 


ip 9 X) 


T{M ) 
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We now discuss real vector bundles [Lang, 1962; Loos, 1969]. 

Definition 2.25 A vector bundle £ over a C°°-manifold M is given by the 
following. 

(a) £ is a C^-manifold. 

(b) There is a C°°-surjection n: E->M called the projection map. 

(c) Each fiber E p = n~ i (p) has the structure of a vector space over R. 

(d) £ is locally trivial; that is, there is a fixed integer n so that for each 
peM there exists an open neighborhood U of p such that U x R n is diffeo- 
morphic to tt *([/) by a diffeomorphism <f> so that the accompanying diagram 
is commutative, where pr x is the projection onto the first factor; specifically, 

UxR n — > n~\U) 



U 


n o 4>{q, X) = q. Furthermore we require for each qeU that <p(q, ) is a vector 
space isomorphism of R" onto E q = n~ i (q). 

Examples (1) £ = T(M) the tangent bundle where n = m and 
R m = T(M, p) = E p . 

(2) Let M be a C^-manifold and let 

T*(M)={J{T*(M,p)}:peM}, 

which is called the cotangent bundle. Then analogous to the construction of 
T(M) we make T*(M) into a C°°-manifold. Thus for £ = T*(M ) we see that 
T*(M) is a vector bundle and E p = n~ 1 (p) = T*(M,p). 

Definition 2.26 Let M and N be C°°-manifoIds and let £ and £' be vector 
bundles over M and N, respectively. A bundle homomorphism is a pair of 
(surjective) maps (£,/) such that: 

(a) £:£->£' and are C°°-maps; 

(b) the accompanying diagram is commutative; that is, ri °F=f°n. 



M N 
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Thus for each peM, F(n~ 1 (p)) c (n f y 1 (f(p)). 

(c) For each p e M, the restriction F : E p -+ E' f(p) of the fibers is a linear 
transformation of the corresponding vector space structures. Also ( F,f ) is a 
bundle isomorphism of E onto £' if it is a bundle homomorphism such that 
the maps F and/ are surjective diffeomorphisms. It is easy to see that in this 
case the pair is a bundle isomorphism of E' onto E . 

Examples (3) Let g : M -► N be a C°°-map. Then we define the map 

T(g) : T(M) - T(N) : ( p , X) (<?(p), [T(< 7 )(p)](X)) 

where A"e T(M,p) and therefore [T(^)(p)](X) e T(N, g(p)). Then (Tg, g) is a 
bundle homomorphism of T(M) into T(N) because the diagram 


T(M) -- T(N) 



M g *■ N 

is commutative and T(g)(p) : T(M,p) T(N,g(p)) is a vector space homo¬ 
morphism. 

Next note if we also have another C°°-map h : L^M of manifolds, then 
g o h\ L~* N is a C^-map and 

T(g o h) = T(g) o T(/z): T(L) - T(N), 

so that (T(^ o h), g o /z) is a bundle homomorphism. Thus T can be regarded 
as a covariant functor from the category whose objects are manifolds and 
morphisms are C°°-maps into the category whose objects are vector bundles 
and morphisms are bundle homomorphisms [Loos, 1969]. 

It will be easy to see later that if G is a Lie group, then the tangent bundle 
T(G) is a Lie group and is isomorphic as a vector bundle and as a Lie group 
to the Lie group g x G (semi-direct product) where g is the Lie algebra of G. 
Thus the tangent bundles which we want to consider are of a relatively 
simple nature. 

Exercise (1) Let M and N be C°°-manifolds and let M x N be the 
corresponding product manifold. Show the tangent bundle T(M x N) is 
bundle isomorphic to T(M) x T(N). 
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7. Vector Fields 


We have previously discussed the coordinate vector fields 8 t = d/dxi and 
saw that they were functions defined on a neighborhood U of p e M which 
assigns to each qeU a tangent vector d t (q) e T(M, q). 

Definition 2.27 Let M be a C°°-manifold and let T(M) be the correspon¬ 
ding tangent bundle. A vector field on a subset A a M is a map X: AT(M ) 
such that n o X — idy | A. Thus X assigns to each p e A a tangent vector X(q ), 
where X(q) e T(M, 4 ), but such that p = idy(p) = (n o Z)(p) = 7 r(Z(g)) = q. 
That is, the tangent vector assigned to p by X is actually in T(M, /?). Also Z 
is a C°°-vector field on A if A is open and if for each / e C°°(A) the function 
Xfi s in C°°(A ) where we define Xf by the action of the corresponding tangent 
vector: (Xf)(p) = [X(p)](f). Thus X is C 00 on M if and only if X : M -+ T(M) 
is a C°°-mapping of manifolds. 

Example (1) Let M = R 1 2 and let A = B( 0, r) the open ball of radius r 
and center 0. Then with coordinates u t , u 2 on A a C°°-vector field X on A can 
be written 

X = , u 2 ) d/dUi + ^ 2 ( W 1 J w 2 ) d/du 2 

where the and a 2 are C°°-functions on A as shown below. Thus a C°°-vector 
field is a well-behaved variable tangent vector. 

Remarks We now consider a vector field on M locally in terms of co¬ 
ordinates. Thus let (U 9 x ) be a chart on M with U open in M, then we have 
the following. 

(1) The coordinate vector fields d t = d/dx t are C 00 -vector fields on U. 

This follows from the previous discussion: d i (p)eT(M,p) so that 
(n o 8 t ) ip) =p. Next if fe C°°(C/), then g =/o x ~ x : x(U) -► R is C 00 on the 
open set x(U) <= R m . Also d t {f) = ox is C 00 on C/ where w x , ..., u m 

are coordinates on R m . 

(2) If X is a C^-vector field on (7, then there exist functions a t e C°°((7) 
such that X=Y, a i &i on Furthermore the a t = X(x Thus the functions 
a i : U R exist because for each q e U, the tangents d^q), i = 1,..., m, form 
a basis of T(M, q) and X(q) = £ ^(< 7 ) for some ^( 4 ) e R. The a* are C 00 
since 5*0**) = 5 ki and = X <*k d k (x t ) = X(x t ) which is in C™(U). Also note 
that if X is a C°°-vector field on M, then the restriction X\ U is a C 00 -vector 
field on U and has the above expression in coordinates. Thus summarizing 
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we see that a vector field X is C 00 on M if and only if for every chart ( U , x) the 
corresponding component functions a t — X(Xj) are in C°°(U). 

The following result is frequently taken as a definition [Helgason, 1962]. 

Proposition 2.28 We have that X is a C°°-vector field on M if and only 
if X is a derivation of the algebra C°°(M) into C°°(M). 

Proof For each f,ge C°°(M) and a,b e R the properties 

X(af + bg) = aX(/) + 6 X( g) and X(fg) = (X/> + /(X?) 

follow from the corresponding properties for tangents (Definition 2.16). Also 
by definition X is it and only if Xfe C G 0 (M). 

We have seen that a C°°-vector field X on M restricts to a tangent X(p) 
and we now consider the converse of extending a tangent to a vector field. 


Proposition 2.29 Let M be a C°°-manifold and let Xe T(M,p). Then 
there exists a vector field X which is C 00 on M such that X(p) = X. 


Proof We can choose a chart (£/, x) at p such that X — £ djtp). Thus 
defining the constant functions a t : U ~^R\q-*b i we see that T = X ^ 
is a C 00 -vector field on U such that X = Y(p ). Now let 0 : M -► i* be a C 00 
“bump function” at p\ that is, from exercise (6), Section 1.4 we have 
pe D<^U, where Z) is an open neighborhood of p and the C^-function </> 
satisfies 0 < 0(.x) < 1 for all x 6 M and </>(#) = 1 if q e D and 0(x) 0 if 

xe M — U, Then we define 

<t>Y on l/, 

(0 on M — C/. 

Thus X(p) = 4>(p) Y(p) = X and by construction X is a C°°-vector field on M. 


Example (2) For the manifold G = GL(V) we identified in Section 2.5, 
Tj(G) with g = #/(F) and for X e g we define a C°°-vector field X on G by its 
action on f e C°°(G) at p e G 

(Xf)(p) = X(/o L(p» = [TL(p)(7) * X](/) 

where L(p) :G^G:q^pq. Then X is^C°° since the right side^of the equality 
consists of C^-operations and note ( Xf)(I ) = X(/) so that X(/) — X. Also 
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X(p) [TL(p)(I)]X e T(G, p) so that [7t°X](p)=p which shows J? is a 

vector field. Using (*) of example (3), Section 2.5 we have 

(Xf )(p) = X(fo L(p)) 

= Jim [/°U(p)](exp tX) - [foL(p)}(I) 

1^0 t 

- Yim ^— exp tX ^ zJM 

t-> o t 

= exp tX ) I f=0 • 

We shall let D(M ) denote the set of all C°°-vector fields on M. Then we 
have the following algebraic results concerning these derivations 

Proposition 2.30 (a) D(M) is a Lie algebra over R relative to the 
bracket multiplication [X, 7] = XY - YX. 

(b) D(M) is a left /--module over the ring F = 

Proof (a) Clearly if X, Ye D(M ) and a, b e R, then aX+bYe D(M) 
by just checking the properties of a derivation. Next we shall show 
[X, Y] = XY — YX is a derivation 

[X, Y](fg) = X[( Yf)g +/( Yg)} - Y[{Xf)g + f{Xg)] 

= (XYf)g + ( Yf )(Xg) + (Xf)( Yg) + f(XYg) 

~ ( YXf)g - (Xf)( Yg) - ( Yf)(Xg) -/( YXg) 

= ([X, Y]f)g +f([X, Y]g). 

The multiplication [X, y] is bilinear and satisfies [X, 7] = - [ 7, X], Also the 
Jacobi identity 

[X, [Y, Z]] + [Y, [Z, X]] + [Z, [X, 7]] = 0 

is a straightforward computation which is always satisfied for the bracket of 
endomorphisms. 

(b) It is easy to see that the various defining properties of a left module 
are satisfied; for example, (/+ g)X =fX + gX or (fg)X =f(gX) for f,geF 
and X 6 D(M [). However, note that D(M) is not a “ Lie algebra ” over F since 
for scalars ” fge F we do not obtain the correct action relative to the 
product 


[fX,gY] =fg[X 9 Y] +f(Xg) Y ~ g( Yf)X / fg[X, Y]. 
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Remark If for a chart ( U , x) on M we let X = £ a t 8 { and Y = £ £* 
be in £>(!/), then for any /e C°°(l/), (XY)(f) = ^(1/) e C C0 (L/) and 


(AT)(/) = £ *(** d k (f )) 

k 



db k d 

- T- + bt CL ; 

dx t dx k 



which shows XY is not a tangent vector. However because the order of 
differentiation can be interchanged, the second-order derivatives vanish in 
[X, Y]. 


Example (3) Let u x , u 2 , u 3 be coordinate functions on M = R 3 and let 

X = u 2 d 3 — u 3 8 2 , Y — u 3 d x — u x d 3 , Z = u x d 2 — u 2 d x . 

Then X, Y, Z are linearly independent (over R) C 00 -vector fields and the 
vector space L spanned by X, Y, Z is a Lie algebra because the products 

[X,Y]=-Z, [Y,Z]=-X, [Z,X]=-Y 

are all in L. 


Next we consider the action of a C°°-map f:M->N on vector fields. First 
we note that/induces a map Tf(p ) : T(M,p ) T(N,f(p)) which maps tangent 

vectors into tangent vectors. However, in general, it is not possible to map 
vector fields on M into vector fields on N by Tf, Thus for any X e D(M) 
define the map 

Tf(X): M - T(N) : p - [Tf(p)]X(p) 
noting that [Tf(p)]X(p) e T(N, f(p)). 

One would like to use ( Tf)X to define a vector field over N or even over 
/(M) by taking a point r = f{p) e N and defining [(Tf)X](r) to be [Tf(p)]X(p). 
However, this is not always possible as shown by the following. Suppose 
p^q but f(p)=f(q). Let XeD(M) be a C°°-vector field such that 
Tf(p)X(p) # Tf(q)X(q) both of which are in T(Nj(p)). Then we can not assign 
a unique value to ( Tf)X at r e N by the desired process. 

Exercise (1) If/: MN is a diffeomorphism, then show that a vector 
field can be defined on N by the formula [( Tf)X] : N -*■ T(N ). 

Definition 2.31 Let/: M -> N be a C°°-map and let X e D(M), Y e D(N) 
be vector fields. Then X and Y are ^related if ( Tf)X = Y °f \ that is, for all 
peM, Tf(p)-X(p)= Y(f(p)). 
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Thus if X and Y are/-related, then for every g e C X (N) 

(Yg) of=X(g°f), 

for if p e M, then 

[( Yg) of](p) = ( Yg)(f(p)) = Y{f(p))g 

= l T f(P) ■ X(p)]g = X(p)(g of) = [ X (g o/)](p). 

Definition 2.32 Let/: M -* M be a C°°-map and let X e D(M) be a 
vector field on M. Then X is/invariant if X is /-related to X. Thus X “com¬ 
mutes” with the action of/by means of the formula TfX= X o f- that is 
Tf(p)X(p) = X{f(p)). 

Another way of viewing the/-invariance of X is by noting that Tf(p)X(p) 

and X(f(p)) are both in T(M,f(p)) so that the /-invariance of X means they 
are equal. 

Example (4) __For G = GL(V) and Xegl(V) we defined the vector 
field X on G by (Xg)(p) = X(g ° L(p)) where g e C°°(G). Now for any a e G, 
X is L(a)-invariant. Thus let/ = L(a), then for any g e C X {G) 

[Tf(p)X(p)](g) = X(p)(g of) 

= [X(gof)] (p ) 

= Y((g of) o L(p)). definition of X 
— X(g o L(a) ° L(p)), using/ = L(a) 

= X(g o L(ap)) = (Xg)(ap) = [X(ap)}(g). 

Thus Tf(p)X(p) = X(JXp)) ; that is, TL(a)X = X ° L(a). The vector field X is 
called left invariant or G-invariant and will be used in yet another definition of 
the Lie algebra of G. 

Proposition 2.33 Let X, and Y l , X 2 and Y 2 be/-related. Then \X, XX 
is /-related to [Y lt Y 2 ]. 2 

Proof Since the X s and Y' s are/-related we have using the paragraph 
following Definition 2.31 for any g e C K (N) that 

Y 2 ( T, g)f(p) = X 2 ( Y t g o/)( p ) 

= X 2 (X t (g °f))(p) = [X 2 X,(g of)](p). 

Thus since a similar formula holds for Y l Y 2 we have 

([ Yi, Y 2 ]g)f(p) = ([ Xl , X 2 ](g » f))(p) 

so that [ T|, Y 2 ] is/-related to [X it X 2 \. 
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Proposition 2.34 Let /: M -»N be a C°°-map. 

(a) If / is an immersion, then for every Y e D(N) there is at most one 
X e D(M) such that X and Y are /-related. In this case the X e D(M) exists 
if and only if for every pe M we have Y(f(p)) e Tf(p)T(M,p ). 

(b) If / is a surjection, then for every X e D(M) there is at most one 
y e D(N ) such that X and F are/-related. 

Proof (a) Let Y e D(N) and let X, Z e D(M) with Tf(p)X(p) = 
Y(f(p)) = Tf(p)Z(p). Then since Tf(p) is injective X(p) = Z(p); that is, X — Z. 
Now if A" exists, then by definition F(/(p)) = Tf(p)X(p) e Tf(p)T(M,p) and 
conversely one can define X by X(p) = 7/(p) _1 F(/(p)) and this defines a 
vector field on M. 

All that remains to show is that X is C 00 . Now since Tf{p) is injective we 
have from Section 2.5 that for p e M there is a chart (K, y) at/(p) in N so that 
((7, x ) is a chart at p where = y t of for i = 1,..., m. Now with these co¬ 
ordinates we let X = £ a { on U, then for qeU we have 

flita) = *(*<)(*) = °f)M 

= [(TfX)(yd)(q)=[(TfX)myi) 

= [Y(m)](yi) = [(Y yi )of]( q ) 

which shows a { is C 00 because Yy { and / are C 00 . 

The proof of (b) is a straightforward exercise. 

Exercises (2) Let /: M-^M be a C^-map. Show that the set of /- 
invariant vector fields in D(M) is a Lie subalgebra of D{M). 

(3) Let G = GL{V) and let p be an analytic multiplication on G; that is, 

p : G x G -► G : (x, y) -+ /*(*, t) 

is an analytic mapping of manifolds. Now form the differential 
Tp : T(< 7 , x) x T(G, y) T(G, p(x, y)); 
that is, for X e T(G, x) and Y e T(G, we have 

1(Tm)(x, y)](X, Y ) e T(G, /i(x, >>)). 

(i) For Xe T,(G) = gl(V) show the map 

Kfi, X):G^T(G):x^ [(Tg)(x, /)](0, X) 
is an analytic vector field on G if and only if jx{x, I) = x for all x e G\ 

(ii) Similarly discuss the function 

r(g, X) : G -* T(G): x -> [(7]u)(7, x)](X, 0); 
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(iii) In case /z is the usual associative multiplication on G , compare the 
vector fields /(/z, X ), r(/z, A), and J? of example (2). 

(4) What can be said about a function fe C°°(R m ) such that [fX, Y] = 
f[X, Y] for all C°°-vector fields X, T e £>(fr) (note Proposition 2.30)? 


8. Integral Curves 


Let a be a C 00 -curve defined on ( a , 6) into M as discussed in Section 2.5. 
Then the tangent vector a (t) is given by d(t) = [Toc(d/du)](t) e T(M, a (t)). 
Thus a . (a, b) T(M) is a C°°-map such that the accompanying diagram is 
commutative. 



Definition 2.35 Let M be an m-dimensional C^-manifoId and let X be 
a C°°-vector field on M. An integral curve of X is a C 00 -curve oc: (a, b) -> M 
such that the tangent vector to a at each t e (a, b) equals the value of X at 
a(f); that is, a (t) = X(cc(t)) all t e (a, b). Thus the accompanying diagram is 
commutative. 


(a, b) — * > J(Af) 



x 

M 


In terms of a chart (U, x) of M we have from Section 2.5, 
a = £ d(x t o a)/df d ( . 
and writing in coordinates on U 
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we obtain a as an integral curve of X if and only if 

d(x t © ct)/dt = a i (x 1 o a, ..., x m ° a) for i = 1 ,..., m. 

We now summarize the facts we shall need concerning the solutions of 
such differential equations and the proofs can be found in the work of 
Dieudonne [1960] and Lang [1968]. 


Proposition 2.36 Let U be an open subset of R m , let p e U, and let 
a { e C°°(C/) for i = 1,..., m. Then 

(a) there exists an open neighborhood D of p with D c= U; 

(b) there exists an open interval ( — e, a) a R ; 

(c) there exists a C°°-map/:( — £, e) x D -* U : (t, w) w) such that 
for each w e D the function a w : ( — e, e) -> U : t -»/(*, w) with a f = u { ° a w for 
/ = 1 , ..., m satisfy 

(i) dctjduffy = a i (a 1 (t)> • •, a*( 0 ) all * € (- e, e), and 

(ii) a f (0) = w t where w t = u^w). 

Moreover if a w : (- e, e) -► U with (- e, s) c (- e 9 e) satisfies (i) and (ii), then 
= «wl(-e, e). 


Thus the unique solutions to the above differential equations depend in a 
C^-manner on the initial conditions. We now translate these facts to mani¬ 
folds [Bishop and Goldberg, 1968; Singer and Thorpe, 1967]. 


Theorem 2.37 (a) Let M be a C°°-manifold and X e D(M) a C°°-vector 
field on M and let peM. Then there exists an open neighborhood D of p 
in M and an open interval ( — £, a) <= R and a C°°-map/: ( —e, e) x D -> M 
such that for each we D the curve 

a w : ( —e, e)->Af : *-►/(*, w) 

is the unique local integral curve of X defined on (— e, e) with ot w (0) = w . In 
particular oc p is a local integral curve through peM. 

(b) For each t e (- e, e) the C°°-map 0(0 given by 0(0: D -> M : w 
f(t , w) satisfies: 

(i) if s,t and s+t are in (-£,£>, then 00 + 0 = 00 ) ° 0(0 on 
(KtV'WnD; 

(ii) if t 6 (— e, e), then exists on Z> n 0(O(£) am * <K0 1 = 0- 
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A map 4> —e 9 e.) x Z)-► M such that 4>(t) satisfies (i) and (ii) above is 

called a local one-parameter group on M. 

Proof (a) Let ( U , x) be a chart at p in M and let U' — x(U) c R m . 
Then on U' we have X = £ a t d* where ^ e C°°(£/'). By Proposition 2.36, 
there exist D' c= U' and ( — 8, e) a R and /': (—e, s) x D'^U' with the 
desired properties which can be translated back toMby x -1 . 

(b) We use the uniqueness part of Proposition 2.36 as follows. For fixed 
te(-e, e) the curves u(s) =f(s 4-t, w) and v(s) = f(s , </>( 0 (wO) are integral 
curves of X defined on a subinterval of ( —e, e) which contains 0 and by the 
initial conditions we have w(0) = i?(0) = /(t, w). Thus by the uniqueness 
u — v\ that is, <p(s + t) = 4>(s) o Also </>(0 -1 = <£( — *)• 

The preceding results on differential equations are also true when C 00 is 
replaced by “analytic.” Furthermore, if the vector field depends analytically 
upon a parameter, then the integral curve does also as follows. 

Definition 2.38 Let M be an analytic manifold and let V be a Euclidean 
vector space over R. Let A denote an element in V and let X(A) be an analytic 
vector field which is a function of A e V, Then X{A) depends analytically on 
the parameter A e V if for any pe M and any function / analytic at p, the 
mapping Dom(/) x V -+ R : (q 9 A) -> [X(A)(f)](q) is analytic. 

Using the results of Dieudonne [1960] and Lang [1968] on this dependence 
we have the following. 

Theorem 2.39 Let M be an analytic manifold, V a Euclidean vector space 
over R , and X(A) an analytic vector field which depends analytically upon 
the parameter A e V. Then for any pe M, there exist an open interval (- e, e) 
c R and an open convex neighborhood U of 0 in V and an analytic map 
u : ( — s, e) x U -> M : (t. A) u{t , A) which is the unique local integral curve 
of X(A) through p e M. 

Proof Since this is a local result, we can assume M is an open set in 
R m so that the vector field X(A) can be represented by analytic functions 

<*i(x,A) on M x V for / = 1.that is, X(A) = f j a i d where 

a i' M x K-> R are analytic. Thus we now have as before a system of (para¬ 
meterized) differential equations for the integral curve, and the results follow 
from Dieudonne [1960, Theorem 10.7.5], for example. 

Exercise ( 1 ) Show that the vector fields X, /(p, X), and r(p, X) in 
exercise (3), Section 2.7 depend analytically on the parameter Xegl(V). 
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Example (1) Let p = (0, 0) e R 2 — M with coordinates ( u x , u 2 ) and 
let X = d x 4- exp( —m 2 ) d 2 be a vector field on all of M. Then the equation for 
the integral curve a is 

dccjdt = 1 and dajdt — exp( —a 2 ). 

Let D = {(x, y) e R 2 : — 1 < y < 1} be an open neighborhood of p and let 
e = e~ 1 . Then for t e( — e, e) and for w — (w x , w 2 ) e D , the C°°-map 

/: (— e, e) x D -► M : (/, w) -► (t 4- , log(t 4- exp w 2 )) 

is such that 

«w(0 = MO, a 2 (0) 

with 

a i(0 = t + and a 2 (t) = log(t + exp w 2 ) 

is a solution to the above equation with a w (0) = w. 

Remark Theorem 2.37 gives only local existence and uniqueness of 
integral curves and it is not always possible to find global curves; that is, it is 
not always possible to extend the domain (—6, e) to all of R. Thus, for ex¬ 
ample, let M = R 2 — {(0, 0)} with coordinates (u x , u 2 ) and let X = d x . Then 
the integral curve oc(t) of X through (1, 0) is a (t) = (t + 1,0) which cannot be 
extended to a curve in M defined on all of R because (0, 0) is not in M. 

Let M = R 2 and let X = —u 2 d l +u i d 2 be a vector field on M. Then the 
general form for the integral curve a w (t) is 

a w(0 = (w>i cos t — w 2 sin t , w 2 cos t -h w x sin t ) 
and a w (0) = w. Note that <x w (t ) is defined for all t e R. 

Definition 2.40 A vector field is complete if all its integral curves have 
domains all of R. 

Exercise (2) Show X = — u 2 d x 4- u x d 2 is complete on R 2 . Is X = 
exp ( — Mi) d x + d 2 complete on R 2 ? 

Examples (2) Let G = GL{V) and let g — gl(V) be identified with 
Tj(G). For X e g we have defined the G-invariant vector field X by ( Xf)(a ) = 
X(f o L(a)) for all a e G and /e C°°(G). Let E u be the usual matrix basis of 
End(F) which gives coordinate functions u xj on G ; that is, M fj (a) = (a^-). We 
write X — £ X { j d/du tj so that X {j = X{u i3 ) are in C°°(G) and we now compute 
the coordinate functions X i} . 

For a, x e G we have 

m 

(Uij o L(a))(x) = Uijiax) = £ u ik {a)u kj (x), 
fc=l 
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using matrix multiplication. Thus applying X to this formula for u i} o L(a) we 
have 

X( UiJ °L(a)) = X(Z u ik (a)u kJ ) 

k 

= Z Uik(a)X(u kj ) = £ u ik (d)x kJ , 

k k 

where we write X = £ x y d/du,^I)e g and have x p4 = X(u pq ) = X pq (I). Thus 
letting / = tty in the definition of X we obtain 

X tj (a) = (Xu^ia) 

= Jf(« y o L(aj) = Z u ik (a)x kj , (*) 

k 

so that the equation for an integral curve a of X is 

d(Uij ° ct)/ds = Z (Ui k ° ci)x k j for i = l, m. 

k 

From example (2) in Section 2.7 on X we have 

(Xf)(p) = df(p exp tX)/dt\, =0 
so that for p = q exp sX we have 

(Xf )(q exp sX) = df[q • exp(,y + t)X]/dt \ l=0 

= df(q • exp uX)/du | H=s = df(q ■ exp sX)/ds, 

where associativity is used in the first equality. Thus for q = I and / = u tj we 
have 

du;j(ex p sX)jds = (Zw y )(exp sX) = £ w it (exp sX)x k] 

k 

using (*) above for the last equality. This shows that oc(j) = exp sX is the 
solution of the equation for the integral curve of X 

dtXjj/ds = Z a ik x kj and a(0) = I, 

k 

where a y = « y o a. In terms of the given matrix X = (X tJ ) this equation can 
be written: da/ds = aX which yields a(j) = exp sX which is a one-parameter 
group defined on all of R. If the initial condition is changed to o<(0) = A, then 
for X we obtain the integral curve a A (j) = A • exp sX and ^(0) = A. From 
this we see X is a complete vector field on G. 

(3) We now consider a Taylor’s series expansion for a real-valued an¬ 
alytic function/on the analytic manifold G = GL(V). Thus let Xeg and X 
be as in the preceding example and let /be analytic at p e G. Then from this 
example we have 

df(p exp sX)/ds = (Xf)(p exp sX) = [X{p exp iX)](/) 
and by induction 

d n f(P exp sX)/ds n = [X(p exp iX)](Z' ,_1 (/)) = (X"f )(p exp sX). 
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Thus if we write g(s ) = f(p exp sX ), we have, since g is the composition of 
analytic functions, the power series in a suitable interval containing 0 e R 

„ £JL 

where the a n e R are computed by differentiation as usual 
a n = d*f(p exp | sas0 = (■ X*f)(p ). 

Thus if we define the operator formula 

[(exp L?)(/)](p) = £ 

n = 0 W! 

we obtain the following version of Taylor’s formula for GL(V) 
f(p exp sX) = [(exp $£)(/)](?). 

Exercise (3) (i) Consider the C^-vector field on R 3 defined by 

X(p) = p 2 (d/dxjip) + p 3 (8/8x 2 )(p) + pi (8/8x 3 )(p) 

where p = (pi ,p 2 ,p 3 )- Find the integral curve a(t) of X so that a(0) = 

(-1,1,1). 

(ii) Let the C°°-vector field on R 3 be given by Y(p) = pip 2 ( d/dx 3 )(p ). 
Compute [X, Y](p). 



CHAPTER 3 

TOPOLOGICAL GROUPS 


In our previous discussion of some matrix groups it was observed that we 
were studying not only the group operations but also the continuity of these 
operations. Thus in this chapter we abstract the situation and consider groups 
which are topological spaces so that the group operations are continuous 
relative to the topology of the space. We then prove facts for these topological 
groups which indicate that much information can be obtained from a neigh¬ 
borhood of the identity element; this leads to local groups and local iso¬ 
morphisms. Next we consider topological subgroups, coset spaces, and normal 
subgroups. Finally, for connected topological groups, we show that any 
neighborhood of the identity actually generates the group as an abstract 
group. 


1. Basics 

In the next chapter, we shall apply the results of the preceding chapters to 
obtain elementary results on Lie groups. However, since a Lie group is a 
topological group, we shall briefly discuss this more general situation. 

Definition 3.1 A topological group is a set G such that: 

(a) G is a Hausdorff topological space; 

(b) G is a group; 

(c) the mappings G x G -► G : (x, y) -► xy and G-+G :x^x~ x are 
continuous, where G x G has the product topology. 
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Thus the set G has two structures—topological and algebraic—and they 
are related by property (c); that is, the group structure is compatible with the 
topological structure. 

The compatibility conditions in (c) are equivalent to the following single 
condition: 

(c') the mapping Cx G -> G : (x, y) -*• xy~ 1 is continuous. 

This condition holds for if (c) holds, then we have that G x G-+G x G: 

( x , y) -* ( x, j> -1 ) is continuous. Consequently the map G x G -> G : (x, y) -*• 
(x, y~ l ) -»xj> _1 is continuous. Conversely if (c') holds, then set x — e (the 
identity) to obtain y (e, y) -» ey~ x = y _1 is a continuous map. Also from 
xy = x(y _1 ) _1 the map (x, y) -* xy is continuous. 

We can express (c) in terms of neighborhoods as follows. For any x, y e G 
and for any neighborhood W of xy in G, there exist neighborhoods U of x and 
V of y with UV c W. Also for any neighborhood U of x -1 , we have U~ i = 
(a -1 : a e U) is a neighborhood of x. Thus replacing x by x -1 , we have if V is 
a neighborhood of x, then V~ x is a neighborhood of x _1 . 

Definition 3.2 Let G be a topological group and let aeG. Then the map 
L(a) :G-*G:x-*ax 

is called a left translation. Similarly the map R(a) : G -* G : x -> xa is called a 

right translation. 

It should be noted that the maps L(a) and R(a) for a e G are homeo- 
morphisms of G. Furthermore given any two points x, y e G, then the homeo- 
morphism L(yx~ v ) maps x onto y. In particular, there always exists a homeo- 
morphism which maps e e G onto any other element aeG and using this, we 
shall see many of the local properties of aeG are determined by those of e. 
Thus, for example, C/is a neighborhood of a e G if and only if U — Ha) V = a V 
where V is a neighborhood of e e G. 

Proposition 3.3 Let G be a topological space which is also a group. Then 
G is a topological group relative to these two structures if and only if: 

(a) the set { e } is closed; 

(b) for all a e G the translations R(a) and L(a) are continuous; 

(c) the mapping G x G -*• G : (x, y) -* xy~ l is continuous at the point 
(e, e). 

Proof Let a,beG with a / b. Then we shall find disjoint neighborhoods 
of a and b as follows. Since L(a) -1 = L(a -1 ) is continuous, L(a) is a homeo- 
morphism. Thus {a} = L(a){e } and {b} = L(b){e } are closed, and there exists a 
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neighborhood U of b such that U n {a} is empty. Otherwise if U n {a} is not 
empty for every neighborhood U of b , then since {a} is closed, b = a. Now 
using (b) the neighborhood U = bV where V is a neighborhood of e . Thus 
from (c) and using e = ee~ 1 we can find a neighborhood Wofe with WW ~ 1 c 
V. We shall now show that aW n bW is empty, and therefore G is Hausdorff. 
Thus suppose xeaW nbW. Then x = aw t — bw 2 with w t e W and therefore 

a = bw 2 ^ 1 e {bW)W~ 1 a bV c U 

which contradicts U n {a} being empty. 

Next we shall show that the topology and group operations are compatible 
by showing the map xy -1 is continuous at any point 

(a, b) e G x G. First we note that 7?(a) -1 = R(a~ x ) is continuous, and there¬ 
fore R(a ) is a homeomorphism. Now let W be a neighborhood of ab" 1 . Then 
a~ x Wb = R(b)L(a)~ 1 W is a neighborhood of e. Using the fact that (x, y) -► 
xj” 1 is continuous at (e, e), we let U and V be neighborhoods of e so that 
UV~ X c: a~ x Wb. Then we have for the neighborhoods aU and bV of a and b 
that 

(aUXbVy 1 = aUV~ 1 b~ 1 cz a(a- 1 Wb)b~ 1 = W, 

and this shows continuity. The converse follows from various preceding 
remarks and is left as an exercise. 

Lemma 3.4 Let U be a neighborhood of e in a topological group G. 
Then there exists a neighborhood V of e such that V c J7, V = K" 1 
(= {v~ x : v e V}) and VV = FK _1 c: JJ. We shall call such a neighborhood K 
of e symmetric. Furthermore, in this case V cz U where V is the closure of V. 

Proof Since multiplication G x G ->■ G is continuous, there exists neigh¬ 
borhoods P and Q of e in G such that PQ c U. Now let W= P n Q and 
V = W n W~ x . Then V is a neighborhood of e with V— V~ l and also 
VV cz PQ cz U. Next let x e V. Then xV is a neighborhood of x and conse¬ 
quently V n xV is not empty. Thus for some i?, i? x e V we have xt? = v t and 
therefore x = t?^" 1 e FK _1 cz £/. 

Exercise (1) Show in detail that the neighborhoods U and V in the 
above proof are such that F cz U. 

Definition 3.5 A subset of a topological group G which contains an 
(open) neighborhood of the identity e is called a nucleus of G. 

Now since the topology of a topological group G is determined by the 
family of neighborhoods at each of its points, we see by using the left or 
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right translations that the topology is determined by the family of nuclei of G 
as follows [Chevalley, 1946; Cohn, 1957]: 

Proposition 3.6 Let y be the family of all nuclei of a topological group 
G. Then y satisfies: 

(a) V u V 2 e y implies V t n V 2 e if; 

(b) V 2 e y and V 2 c. W <=■ G implies W e y ; 

(c) for any V 1 e y, there exists Kef such that VV~ l <= V l ; 

(d) if Ve y and a e G, then aVa -1 e y ; 

(e) f| {V:Vey} = {e}. 

Conversely, given a group G and a family of subsets y of G satisfying 
(a)-(e), then there exists a unique topology for G relative to which G becomes 
a topological group and y is exactly the family of nuclei for this topological 
group. 

Proof Properties (a)-(e) are immediate and the converse can be regarded 
as a straightforward exercise. For example, define the topology on G by 
saying that W is open if xe W implies there exists Kef and xV a W. The 
family of such sets W satisfies the axioms of a topology which makes 
G x G ^ G : (x, y) -+ xy' 1 continuous. This topology is Hausdorff since M 
is closed: If a # e, then a -1 # e and by (e) there exists Ve y with a~ l = 
a~ 1 -e$V. Thus e^aKso that a is not in the closure of {e} ; that is, {e} is 
closed. 

Examples (1) The matrix groups of the preceding chapters are topo¬ 
logical groups, as are discrete groups: Let G be any group and let the topology 
be such that every subset of G is open; that is, the discrete topology. Then G 
is a Hausdorff space, and since G x G has the product topology (which is 
discrete), any map G x G -> G is continuous. 

(2) The additive group of R with the usual metric topology is a topo¬ 
logical group. However, if the topology is changed to another topology where 
the half-open intervals [a, a + e) with e > 0 are taken to be a neighborhood 
basis at a e R, then the operation R -* R: x -*■ —x is not continuous at 0 e R 
so R, with this topology, is not a topological group. 

(3) Let G l and G 2 be topological groups. Then the product space 
G l x G 2 with the product topology and the pointwise operations 
(x u x 2 ){y 2 , y 2 y l = (x^r*, * 2 yi~ l ) becomes a topological group called the 
product group of G t and G 2 . This example can be generalized to the semidirect 
product of Gj and G 2 as follows [Hochschild, 1965]. Let <j> be a homomorphism 
of G 2 into the automorphism group of G, denoted by Aut(G',)- On the product 
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space (?2 X 6], put the product topology and require that the map G 2 x G x 
Gi : (y, x) —> <t>(y)x be continuous. Then G x x G 2 with the product topology 
and product defined by 

(*> y)(*u yi) = (*W>O0*iL yy x ) 

is a topological group called the “semidirect product” and denoted by 
Gi when the homomorphism $ : G 2 -► Aut(G t ) is such that <f>(y) is 

the identity, then we obtain the direct product. 

As an example of a product group we note that the unit circle S 1 = 
{e 2ntx : x e R} is a topological group relative to multiplication in the complex 
numbers. Then the torus T n = S 1 x * • * x S 1 as given in Section 2.1 can be 
regarded as a topological product group. 

Exercise (2) Let G = G x x <t> G 2 be the semidirect product as in the 
above example (3). 

(a) Show the multiplication of G is associative. 

(b) What is the inverse of ( x , y) in G? 

(c) Show H x = G x x^le} = {(x, e) : x e GJ is a normal subgroup of G. 

(d) Is H 2 = {e} x 4> G 2 necessarily a normal subgroup of G? 

Definition 3.7 A local group is a Hausdorff space N such that: 

(a) there is a binary operation in N, (x, y) -► xy which is defined for 
certain pairs (x, y) e N x N; 

(b) the operation is associative when defined. Thus if x, y, z e N and 
(xy)z, x(yz) e N, then ( xy)z = x(yz); 

(c) there exists an identity element e e N. Thus for all xe N, xe and ex 
are defined and xe = ex = x; 

(d) there exists an inverse operation in N, x->x~ l which is defined for 
certain elements xe N such that if x" 1 is defined, then xx _1 and x _1 x are 
defined and xx -1 = x _1 x = e; 

(e) the maps (x, y) —► xy and x x~ 1 are continuous where defined. 

Thus if xy — z is defined in N , then for any neighborhood U of z in N 
there exist neighborhoods V of x and W of y in N such that VW = 
{vw : ve V and we W} is defined and VW c= JJ. Similarly for the map 
x —> x A . 

Remark (1) Any open nucleus of a topological group is a local group, 
and we use local groups later in discussing Lie groups. 

Analogous to the proof of Lemma 3.4, we have the following result: 
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Proposition 3.8 Let N be a local group. Then there exists a neighborhood 
U of e in N such that: 

(a) ( xy)z = x(yz ) all x, y, z e U; 

(b) xx _1 = x _1 x = e all xe U; 

(c) ex = xe = x all xe U; 

(d) for all x e U we have x _1 e U ; 

where all the above products and inverses actually exist in N. A neighborhood 
U satisfying these conditions is called a germ of the local group N. 

Definitions 3.9 (a) The local groups N and N' are topologically iso¬ 
morphic if there exists a homeomorphism f:N^>N':x-+ f(x) such that the 
product xy is defined in N if and only if the product f(x)f(y) is defined in N' 
and in this case /(xy) = f(x)f(y). 

(b) The topological groups G and G' are locally isomorphic if they have 
open nuclei which as local groups are topologically isomorphic. 

Example (4) The topological groups R and the torus T 1 have neighbor¬ 
hoods N and N 1 of the respective identities which are topologically isomorphic 
as local groups. Thus R and T 1 are locally isomorphic but not isomorphic 
as groups. 

Remark (2) In the definition of a topological group, we assumed the 
topological space G to be Hausdorff. However, using the group structure of 
G , we can start with weaker separation axioms for G and obtain stronger 
separation theorems than being Hausdorff. Good accounts of these theorems 
can be found in the work of Hewitt and Ross [1963] and Montgomery and 
Zippin [1955], and we now summarize some of the results. 

Definitions (a) A topological space M is a T 0 -space if for any given pair 
of distinct points x, y e M, there exists an open set U of M which contains 
one of these points but not the other. A r 0 -topological group is a group G 
which is a J 0 -space and such that the map G x G -+ G : (x, y) -+ xy" 1 is 
continuous. 

(b) A metric (or pseudo-metric) d on a group G is left invariant (respec¬ 
tively right invariant) if for all a, x, y e G we have d(ax, ay) = d(x, y) [respec¬ 
tively d(xa , ya) = d(x , y)]. If d is both left and right invariant, then d is called 
two-sided invariant or just invariant. 

Theorem Let G be a r 0 -topological group. Then G is metrizable if and 
only if there is a countable (open) basis at the identity e e G. If this is the case, 
then the metric can be taken to be left invariant. 
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Corollary Let G be a r 0 -topological group such that every point of G 
has a neighborhood U so that U is countably compact; that is, every countable 
open covering admits a finite subcovering. Then G has a left invariant metric 
which yields the original topology if and only if {<?} equals the intersection of a 
countable family of open sets. 

Corollary Let G be a T^-topological group which is compact and such 
that {e} equals the intersection of a countable family of open sets. Then G has 
an invariant metric which yields the original topology of G. 

Definition A topological space M is a 7,-space if for distinct points 
x # y in M, there exists an open set U with xe U but y i U, and there exists 
an open set V with y e V but x^V. 

Exercise (3) Show that a T 0 -topological group is a 7',-space. 

Definitions Let M be a 7\-space. Then: 

(a) M is regular if for every closed set F in M and every x £ F, there 
exist disjoint open sets U and V such that xeU and F<= V; 

(b) M is completely regular if given any x e M and given any closed set F 
with x£ F, there exists a continuous function g:M->[ 0,1] such that 
g(x) = 0 and g(F) = 1; 

(c) M is normal if for every pair of disjoint closed sets F l and F 2 in M, 
there exist disjoint open sets U 1 and U 2 of M such that F { c U 1 and F 2 c U 2 ; 

(d) M is paracompact if every open covering of M has a locally finite 
refinement. Recall that an open covering °U is locally finite if for each xeM, 
there exists an open set V(x) which contains x and such that 
{Ue \ U o V(x) ^ <j>} is a finite set. Also recall that a locally finite refine¬ 
ment means that if 6 U is any open covering, then there exists a locally finite 
open covering i r such that VeF~ implies there exists U e % with V <= U. 

We have (d) implies (c) implies (b) implies (a). 

Theorem Let G be a J^-topological group. Then G is completely regular 
and consequently Hausdorff. 

Theorem Let 6 be a locally compact T 0 -topological group. Then G is 
paracompact and consequently normal. 

There exist r 0 -topological groups which are not normal topological spaces 
[Husain, 1966]. 
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2. Subgroups and Homogeneous Spaces 


We shall consider subgroups H of a topological group G and the cor¬ 
responding space of left cosets G/H = {aH: aeG}. Then we eventually con¬ 
sider the case when if is a normal subgroup so that G/H becomes a topological 
group. 

Definition 3.10 Let G be a topological group and let H be a subset of G 
such that HH _1 c H. Then H is a subgroup of G (in the abstract sense). The 
topology of G induces a topology on the subgroup H by requiring U c= H to 
be open if and only if U = H n V where V is open in G. If with this induced 
topology, the subgroup H becomes a topological group we call H a topological 
subgroup. 

Remark (1) We shall be interested in closed subgroups H of G ; that is, 
H is closed as a subset of G. However, we note that if H is an open subgroup of 
G, then H is closed. For, since H is open, so is aH for all aeG. Therefore 
K = (J {aH : a $ H) is open so that the complement of K, which is H, is 
closed. 

Exercise (1) If H is a subgroup of the topological group G, then show 
its closure H is also a subgroup of G. More generally, one can show that for 
subsets A, B of a topological group G that A B a AB , (A)' 1 — ( A -1 ) and 
a Ah = a Ab for all a, be G. 

Theorem 3.11 Let H be a topological subgroup of the topological 
group G and let n : G -► G/H : a aH be the natural projection. Then 

(a) G/H can be made into a topological space such that: 

(i) the projection n : G -» G/H is continuous, and 

(ii) if iV is a topological space and if /: G/H^N is such that /° n : 
G TV is continuous, then / is continuous. 

The topology defined on G/H is uniquely determined by (i) and (ii) and is 
called the quotient topology. 

(b) G/H with the quotient topology is such that n is an open map; that is, 
U is open in G implies n(U) is open in G/H . 

(c) The quotient topology is Hausdorff if and only if H is a closed subset 
of G. 

Proof We define the topology on G/H by requiring a subset U of G/H 
to be open in G/H if and only if the inverse image n~\U) is open in G. Then 
the axioms for open sets of a topology in G/H are satisfied. Furthermore, if U 
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is open in G/H , then by definition n~ x (U) is open in G so that n is continuous. 
To prove (ii), let/° n : (7 -► N be continuous and let W be open in N. Then 
(f° rc) -1 (J*0 = 7C_1 (/ _1 (^ / )) is open in G. Thus by definition of the quotient 
topology, f~ x (W) is open in G/H so that/is continuous. 

Let be the topology given above on G/H and let 3” be any other topo¬ 
logy on G/H satisfying (i) and (ii). Let / be the identity map of G/H with 
topology 2T onto G/H with topology 3T '. Then fo n is continuous since 
fo n(a) = aH = n(a ) and satisfies (i) for n : G -► G/H. Thus since G/H 
with topology 2T satisfies (ii) where N = (G/H, we see that / is contin¬ 
uous. Interchanging the roles of ZT and 2T', we obtain/ _1 is continuous so 
that the identity map / is a homeomorphism; that is, ST — . 

Next, to show that n is an open map, we note that if U is open in G, then 
Ua is open for all as G. Consequently UH = (J {Ua : a e H] is open in G . 
However, since UH = n~ x [n(U)}, we have by the definition of the quotient 
topology that n(U) is open in G/H. 

For (c), we note that G/H being Hausdorff yields the fact that [eH] is 
closed in G/H. This implies H = n~ x {eH} is closed in G because n is contin¬ 
uous. Conversely, suppose H is closed in G and let aH ^ bH in G/H. Then 
a $ bH. Thus since bH is closed, there exists a neighborhood U of e in G such 
that Ua is a neighborhood of a in G and Ua n bH is empty. From Lemma 3.4, 
there is a neighborhood Fof e such that V~ X V c= U and consequently ( Va)H = 
V(aH) and V(bH) are neighborhoods of aH and bH, respectively. This uses 
n : G -► G/H as an open map and Va and Vb as neighborhoods of a and b in 
G. These neighborhoods are disjoint, for if p e V(aH) n V(bH ), then p = 
vah = v t bh x for v, e V and h, h x e H. Therefore q = v1 x va = bhj\~ x is in 
Ua n bH, a contradiction. 

Definition 3.12 A subgroup H is a normal subgroup of G if aHa" 1 c H 
all a e G and then G/H is a group relative to aH ■ bH — abH which is called 

the quotient group. 

Corollary 3.13 Let H be a closed normal subgroup of the topological 
group G and let G/H be the quotient group. Then relative to the quotient 
topology, G/H becomes a topological group such that the projection n : G -> 
G/H i s an open continuous homomorphism. 

Proof It suffices to show that 

G/H x G/H -> G/H : (aH, bH) -^ab~ 1 H 

is continuous. Let U be a neighborhood of ab~ l H= n(ab~ 1 ) in G/H . Then 
7i~ x (U) is a neighborhood of ab~ x in G. Now there exist neighborhoods Vo fa 
and W of b in G such that VW~ X c n~ 1 (U). However, since n is open, n(V) 
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and n(W) are neighborhoods of aH = n{a ) and bH = n(b), respectively. Thus 
n(V)[nilV)]* 1 = <= U which proves continuity. 

Corollary 3.14 If H is an open normal subgroup of the topological group 
G, then G/H is discrete. 

Proof Since H is open, the cosets aH for a e G are open in G. Thus since 
n is an open map, the sets {aH} in G/H are open. Therefore G/H is discrete. 

Corollary 3.15 Let/: G -» G be a homomorphism of topological groups. 
Then / is continuous if and only if /is continuous at the identity e e G. 

Proof Assume/is continuous at the identity. Let a e G and let f(a)U be 
a neighborhood of /(a) in G where U is a neighborhood of e in G. Since/is 
continuous at e e G and since e = f(e ), there exists a neighborhood £/ of e in 
G such that/([/)cEJ which proves continuity at a since at/ is a neighborhood 
of a in G with f(aU) czf(a)U. 

Using the preceding results with the isomorphism theorem for groups, we 
have the following result which we leave as an exercise [Cohn, 1957]. 

Theorem 3.16 Let f :G->G be a continuous homomorphism of the 
topological groups G and G and let H = {x e G :f(x) = e} be the kernel off 
where e is the identity of G. Then: 

(a) H is a closed normal subgroup of G and n : G -► G/H is a continuous 
homomorphism; 

(b) there is a continuous monomorphism g : G/H ^ G such that/ = g ° ft; 

(c) let 7/ and A be closed normal topological subgroups of G such that 
N a H. Then G/H is topologically isomorphic to ( G/N)/{H/N ). 

Example (1) Let R be the additive group of the reals with the usual 
metric topology and let Z, be the additive subgroup of the integers. Then Z is 
closed in R and the quotient group is topologically isomorphic to the multi¬ 
plicative group of complex numbers of absolute value 1 denoted by T 1 or S 1 ; 
that is, the one-dimensional torus. Then R/Z is frequently called the (one¬ 
dimensional) torus group or the group of reals modulo 1. The above iso¬ 
morphism uses the fact that the map /: R - S 1 : x - is a continuous 
epimorphism with kernel Z. Thus, from Theorem 3.16, S 1 is topologically 
isomorphic to R/Z. 

Exercise (2) Generalize the above example by finding an explicit homo¬ 
morphism /: R n -► T n of the topological groups. What is the kernel of this 
map? 
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Definition 3.17 Let M be a Hausdorff topological space and let G be a 
topological group. Then: 

(a) G operates on M if there is a surjection 

G x M -► M : (g, p) -► # • p 

such that Oi£ 2 ) ‘ P = ' (#2 \P) and e • p = p for all g l9 g 2 eG and p e M 

where e is the identity of G. 

(b) G operates transitively on M if for every p,qe M, there exists g e G 
such that g • p = q. 

(c) G operates continuously on M if the map G x M -► M : (g, p) ~> g • p 
is continuous. 

(d) G is called a topological transformation group on M if G operates 
continuously on M. [Note that for each g e G, the map z(g) : M-+ M : p-^g • p 
is a homeomorphism.] 

(e) G is effective if a • p = p for all p e M implies a = e. 

(f) Let P be fixed in M. Then G(p) = {g e G : g • p = p) is a group called 
the isotropy subgroup of G at p or fixed point subgroup at p. The set G * p = 
{# * p g M: # e G} is called an orbit under G. 


Exercise (3) If G acts transitively on M, then for given p, q e M the 
isotropy subgroups G(p) and G(q) are conjugate in G. 


Example (2) Let G be a topological group and let H be a closed sub¬ 
group. Then the space M = GjH is a Hausdorff space according to Theorem 
3.11 and G operates continuously on M by the map G x M ^ M: (g, xH) -► 
(gx)H. For each g e G the map z(g) : M -► M : xH -► gxH is a homeomorph¬ 
ism, and using this, we see G acts transitively on M. The coset space M — GjH 
is called a homogeneous space. 

Exercise (4) Show that G is effective on GjH if and only if H contains 
no proper normal subgroup of G. 

Theorem 3.18 Let M be a Hausdorff space and let G be a transitive 
topological transformation group operating on M. Let p be some (fixed) 
point in M and let G(p) be the isotropy group at p. Then G(p) is a closed 
subgroup of G, and the map f:G-+G-p:a^>a-p induces a continuous 
bijection /: G/G(p) M such that Jo n =f; that is, the accompanying dia¬ 
gram is commutative. 
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G - > M 



GIG(p) 


Proof First, since G is transitive, the orbit G * p equals M . Next, since 
the map/: G M: a - p is continuous and {p} is closed in the Hausdorff 

space M, we have G(p) =/ _1 (p) is closed. 

Now note that for a e G and h e G(p) we have f (ah) = ah * p = a • (h * p) = 
a • p — f(a). Thus the map 

/: G/G(p)^M:aG(p)->f(a) 

is actually a well-defined function, for if aG(p) = bG(p), then b = ah for some 
h e G(p) and therefore f(b) = /(tf/0 = /(«)• Next / is bijective since f(a) =f(b) 
implies a~ l b * p = p and therefore a~ 1 be G(p ); that is, aG(p) = bG(p). 
Finally, since we clearly have f = f and since/: G —► Mis continuous, we 
have from Theorem 3.11 that / is continuous. 

Corollary 3.19 If/: G/G(p) -> M is open or if G/G(p) is compact, then / 
is a homeomorphism; that is, M is a homogeneous space. 

Proof If / is open, then by definition of continuity / 1 is continuous. 
If G/G(p) is compact, then we use the following general results: Let S be a 
compact space and let T be a Hausdorff space. Then any continuous bijection 
g : Tis a homeomorphism [Singer and Thorpe, 1967, p. 24]. 

Remark (2) If G is compact, then M is a homogeneous space, for in 
this case G/G(p) is compact since n : G -► G/G(p) is continuous. Note there is 
almost a converse statement: If H is a closed compact subgroup of G such that 
G//f is compact, then G is compact [Chevalley, 1946, p. 31; Hochschild, 1965, 
p. 8]. 

Corollary 3.20 If G is a locally compact group with countable basis and 
if M is a locally compact Hausdorff space, then / is a homeomorphism of 
G/G(p) onto M. 

The proof of this can be found in the work of Helgason [1962, p. 111]. 

Example (3) For n > 2, let 5” -1 = [x e R n : |MI = 1} be the unit sphere 
where || jc|| 2 = B(x, x) is the usual inner product on R n . Let G = 0(n) = 
{A e GL(n, R): B(Ax, Ax) = B(x, x) all x e R n } and let p = (1,0,..., 0) e S" K 
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Then 0(ri) operates continuously and transitively on S n 1 by the map 
0(n ) x S "- 1 5 n_1 : (A, x ) -> Ax. 

From this we see that Ap = p if and only if 



where B e 0(n - 1) so that we obtain G(p) = 0(n - 1). Thus using 0(n) is 
compact [exercise (5) below], its continuous image 0(n)fO(n — 1) is compact 
and therefore 0(n)/0(n — 1) is homeomorphic to S"" 1 . 

In the work of Chevalley [1946, p. 32] it is similarly shown that some of 
the other groups discussed in Section 2.3 also yield homogeneous spaces 
which are homeomorphic to spheres. For example, if n > 2, then we also have 
SO(n)jSO(n — 1) is homeomorphic to S”" 1 . 

Exercise (5) Show 0(n) is compact possibly as follows. First using the 
above representation of 0(ri) in terms of B, show 0(n) is a closed subset of 
End(. R n ). Next using A A* = /, show Q(n) is bounded; thus it is compact. 


3. Connected Groups 


In this section we shall show that much of the topology and many other 
relations are determined by the connected component of the identity element 
of a topological group. 

Definition 3.21 Let M be a topological space and let p e M. Then p is 
contained in a unique maximal connected subset C(p). This set C(p) is closed 
and is called the connected component of p. For M — G a topological group, 
the connected component of the identity e e G is called the identity component 
of G and is denoted by G 0 . 

Theorem 3.22 Let G be a topological group, and let G 0 be the identity 
component. Then: 

(a) G 0 is a closed normal topological subgroup of G and the connected 
component C(a ) of a e G equals aG 0 ; 

(b) If G is locally connected (that is, if every point aeG has a connected 
neighborhood), then G/G 0 is discrete. 
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Proof First, since G 0 is the component C(e), we have G 0 is closed. Next 
let aeG 0 . Then since multiplication is continuous, a~ 1 G 0 is connected and 
contains e. Thus a~ 1 G 0 is a connected set containing e , and since G 0 is the 
maximal connected set containing e, we have a~ x G 0 c= G 0 so that G 0 is a 
subgroup. Also for any x e G we see x~ x G 0 x contains e and is connected. 
Therefore x~ x G 0 x cz G 0 so that G 0 is normal. 

Next since x -* ax is a homeomorphism of <7, then aG 0 is connected and 
contains a so that a<7 0 <= C(a). Also a _1 C(a) is connected and contains e so 
that a~ x C(d) c: G 0 ; that is, C(a) = aG 0 . 

For (b), let G be locally connected so that there is a connected neighborhood 
U of e in G . Then since n is an open map, n{U) is a neighborhood of eG 0 in 
G/G 0 . However, since U is connected we have U c= G 0 so that 7r(C/) = {e<7 0 }. 
Thus {eG 0 } is open so that G/G 0 is discrete. 

Proposition 3.23 Let G be a topological group, let G 0 be the identity 
component, and let U be any open neighborhood of e e G. 

(a) If U is a symmetric neighborhood, then H = (j£L i U k i s an °P en and 
closed subgroup of G. If U is connected, so is H. 

(b) G 0 -(U?-i U k )nG 0 . 

(c) If G is connected, then G = (j£Li U k . Thus any open neighborhood 
of e is a set of generators of a connected topological group as an abstract group. 

Proof (a) If U is symmetric, then for x e U m , y e U n , we have xy e u m+n 
and x~ x e(t/ -1 ) m = £/ m so that H is a subgroup. Next since V is open, 
U 2 = (J {at/: a e U] is open and by induction U k is open. Thus H is an open 
subgroup. However, from remark (1) of Section 3.2, H is also closed. If U is 
connected, then so is each U k and therefore H is connected (using e e U k ). 

(b) Let V be a symmetric neighborhood of e such that Fc(/ and let 
W = V n G 0 . Then IF is a symmetric neighborhood of e in G 0 and H = 
(J W k ci (J V k n G 0 . However, H is a nonempty open and closed subgroup 
of G 0 and since G 0 is connected we have G 0 = H. Since (J V k c (J U k , we 
obtain the result. 

(c) Part (c) follows from (b) since G = G 0 . 

Definition 3.24 Let G be a topological group. Then the center C of G 
equals {xe G : xa = ax for all aeG}. The center is a normal subgroup of G 
and is also denoted by Z(G). 

Proposition 3.25 Let G be a connected topological group and let H be a 
discrete normal topological subgroup of G. Then H cz C, the center of G . 




104 


3. TOPOLOGICAL GROUPS 


Proof Let aeH. Then the map G H : x -> x~ l ax is continuous. 
However, since G is connected and H is discrete, the image is a single point 
pe H. In particular p = e~ 1 ae = a so that x~ 1 ax = a for all x e G; that is, 
1/cC. 

Proposition 3.26 Let G be a topological group and let H be a closed 
topological subgroup such that H is connected and G/H is connected. Then G 
is connected. 

Proof Let H and G/H be connected and assume G = U u V where U and 
V are nonempty open sets. The open map n: G^> G/H maps U and V onto 
open sets U x = UH and V t = VH in G/H. Since G — U u V, we have G/H = 
U 1 u V u and since G/H is connected, there exists aH e ^ n = C/Z7 n 17/. 
Thus aH e UH yields he H with ah = ue U. Thus aH n U is not empty. 
Similarly aH n F is not empty. However, since G = U u F, we have aH = 
(a// n 17) u (a77 n K), and since H is connected and aH is homeomorphic to 
H, we have aH is connected. Thus {aH n U) n {aH n F) is not empty. This 
implies U n V is not empty so that G is connected. 

Definition 3.27 A topological space M is locally Euclidean of dimension 
m if each point pe M has a neighborhood which is homeomorphic to an open 
set in 7£ m . Note that an open subset of R m cannot be homeomorphic to an open 
subset of R n if m # n. 

Examples (1) The torus T 1 = R/Z is connected and R is connected, but 
Z is not connected. Thus the connectedness of a subgroup H cannot be 
deduced from that of G and G/H. Also note T 1 is locally Euclidean of dimen¬ 
sion 1. 

(2) Using the result that the sphere S n ~ 1 is connected and from Section 
3.2 that SO{ri)/SO{n — 1) is homeomorphic to S'”' 1 , we shall show SO{n ) is 
connected. Consequently, since SO{n) is actually a C°°-manifold, we have 
from Proposition 2.15 that SO{n) is path connected. First note that 5G(1) is 
just the identity linear transformation I. Assume SO{n - 1) is connected. Then 
since SO{n)/SO{n — 1) is connected (because it is homeomorphic to S'" -1 ) we 
have by Proposition 3.26 that SO{n) is connected. Also we previously noted 
that 0{n) is not connected, but note that SO{n) is the identity component of 
0{n) and the order of 0{ri)/S0{ri) is 2. Thus 0{n) has two components, one 
which consists of matrices of determinant — 1 and the other is SO{n). In a 
similar manner, it is shown by Chevalley [1946, p. 36] that various other 
matrix groups are connected. 



CHAPTER 4 

LIE GROUPS 


We now discuss some elementary results of Lie groups which can be easily 
done without introducing the Lie algebra. First we see that a Lie group is a 
topological group which is an analytic manifold so that there is compatibility 
between the topological, manifold, and group structures. Next we give results 
which tell when a topological group is a Lie group and when a local group 
generates a Lie group. Finally we discuss Lie subgroups and when an abstract 
% subgroup can be considered as a Lie subgroup. 


1. Basic Structures 


In this section we give the basic definitions for a Lie group and show how 
the analytic structure of a Lie group is uniquely determined. 

Definition 4.1 A Lie group is a set G such that: 

(a) G is a group; 

(b) G is an analytic manifold; 

(c) the group multiplication in (a) of the product manifold 

fi: G x G G : (jc, y) xy 
and the group inversion operation in (a) 

i: G->G : x-> x~ x 
are analytic functions relative to the structure in (b). 
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Remarks (1) A Lie group is, in particular, a topological group relative 
to the topology induced by its analytic structure, and the question arises 
when is a topological group actually a Lie group. This is discussed by 
Montgomery and Zippin [1955, p. 184], and among the many results is the 
following: A connected locally Euclidean topological group is isomorphic to 
a Lie group. 

(2) The fact that i: GG : x -> x~ x is analytic follows from (a) and 
(b) and p : G x G-> G is analytic by using the implicit function theorem. 
Briefly, let ( U , x) be an analytic chart at e in G and for u, v in a suitable open 
nucleus V <=z U 9 we have that x { ° p(u , v) = p^u, v) defines an analytic function 
with e) = fii(e, v) = v t . Thus D^x <=> p)(e 9 e) = (dpjdxfa e )) = (5 V ). By a 
variation of the implicit function theorem, the equation x ° p(z 9 v) — *(e) has 
a solution z — 9(v) in some neighborhood of e where 6 is actually analytic; 
that is, 0 t = Xi o 0 are analytic. However, zv — e has the solution z = v~ l so 
that the map v -► v~ i is analytic at e. Now by using the analyticity of the left 
and right translations (from pi: G x G -*■ G is analytic), we have i is analytic 
on all of G. 

Examples (1) The matrix groups GL(V), SO(n , R ), etc. previously con¬ 
sidered are Lie groups. Also the torus T 1 and more generally T” is a Lie group. 
For this, we use x -> e 2nix is analytic and so is the multiplication 
pt(e 2nix 9 e 2ltiy ) = e 2 ltl ( x+ y\ Next use the fact that if G u ..., G n are Lie groups, 
then G t x • • • x G n with the product group and analytic structure is again a 
Lie group which we leave as an exercise. 

(2) If G is a discrete topological group, then e has the open neighborhood 
{e} which is homeomorphic to R° = {0}; that is, a discrete topological group 
can be considered as a zero-dimensional Lie group and conversely. 

(3) Let R denote the manifold of the real numbers with the usual co¬ 
ordinate u: R^> R : t and define 

p : R x R -► R : (x, y) -► (x 3 + y 3 ) 1/3 . 

Then (R, p) is a topological group but not a Lie group, since p is not analytic 
at (0, 0) relative to the above coordinate. 

Exercise (1) Show that R with the above multiplication p becomes a 
Lie group relative to the analytic structure on R given by v : R-* R : tt 3 . 

We now consider the existence and uniqueness of an analytic structure 
determined by a nucleus. 

Proposition 4,2 Let G be a connected topological group with multiplica¬ 
tion p(s 9 t) = st , let ( U , x) be a chart at e in G, and let V U be an open 
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nucleus such that /x(F, V) <= U. If the function x ° p is analytic on V x F, then 
there exists a unique analytic structure s# on G which makes G into a Lie 
group ^ such that (W, x)e s/ where W is a suitable open nucleus contained 
in U . In this case the topology induced by the analytic structure on ^ equals 
the original topology of G. 

Proof First we define the analytic structure s/ on G. Let the nucleus V 
be as above and let W be an open nucleus such that WW~ X c: V. Then since 
eeW we note W c V cz U and for any a e G we have 0 W is a neighborhood of 
a in G. Now (IF, x) defines an analytic chart at e e G and we now define 
coordinates on the neighborhood a W by 

y : aW -► x(W) : au-+ x(u). 

Thus since x(W) is open in R m , we obtain a chart (aIF, y), where for peaW 
we have y(p) = x ° L(a) -1 (p). Furthermore since x : W -+ R m is analytic so is 
y : aW-+ R m . 

We shall next show that these charts a IF with a e G are actually analytical¬ 
ly related. Thus they form an analytic atlas & which covers G, and we obtain 
an analytic structure by taking the maximal atlas st which contains So 
suppose (a IF, y) and (6IF, y) are charts, and let p e aW n bW. Then we must 
show y o y _1 and joy -1 are analytic. We have from p = p(a 9 u) = aueaW 
and p= bvebW that = IFIF~* cz V. Therefore the map 

L{b~ 1 a) : F-+ £/: z -► p(b~ 1 a, z) is analytic on Fby hypothesis. Next we have 
from y = x o L{a)~ x and y = x ° Lib)' 1 that 

y 0 y _1 = * o L(bY x o L(a) ° x _1 = x © L(b~ x a) o x _1 

is an analytic function from x(IF) cz into jc(G) c= i? w . Similarly y o jr 1 is 
analytic. 

By definition, the chart (IF, x) is in the analytic structure s4 defined above. 
Let ^ denote the analytic manifold G with the analytic structure stf . Then by 
definition of stf the map L(a' 1 ) is an analytic diffeomorphism which maps a 
neighborhood of a e & onto a neighborhood of e e Consequently, since the 
multiplication p is analytic at ( e , e) e & x it is analytic on all of ^ x 
that is, ^ is a Lie group; (see exercise (2) below). 

The analytic structure s/ on the Lie group ^ is unique, since it is completely 
determined by the given chart (U 9 x) at e e G. Thus if another open nucleus 
W x c= U determines an analytic structure s4 x on G using the coordinate map 
x; then by considering IF n W u we see that the identity /: ^ -► and its 
inverse /: are both analytic, where is the group corresponding to 

the analytic structure on G. Finally the map /: G ^ is a homeomorphism 
at e and therefore everywhere; that is, the topology induced by the analytic 
structure on ^ equals the original topology on G. 
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Definition 4.3 Let G and G x be Lie groups. Then the map <p : G G t is 
an analytic homomorphism if (j) is an analytic mapping which is a homo¬ 
morphism of the groups. Thus </> is an analytic isomorphism if <j> is an analytic 
homomorphism such that (j)~ i exists and <j) and <j)~ x are both analytic 
(diffeomorphisms). 

Remarks (3) From Proposition 4.2, we see that in a topological group 
a chart at e e G with local analytic multiplication determines a unique Lie 
group structure. Now suppose that two charts at e are given such that the 
multiplication g is analytic relative to these charts. Then possibly two distinct 
analytic structures and s0 2 can be determined which give rise to two Lie 
groups and . However, we shall show that this is not possible by eventual¬ 
ly showing continuous isomorphisms of Lie groups are analytic isomorphisms. 
Thus the analytic structure is completely determined by the topology. 

Exercise (2) Show in detail that the multiplication in the proof of 
Proposition 4.2 is analytic at any point (a, b) e $ x ^possibly as follows. First 
show inversion i: ^ -> ^ : x -► x~ l is analytic at e . Using this and ^ is con¬ 
nected, show for any a e & that R(a ) = i © L(a~ x ) © i is analytic at e. Next 
observe the map oc x ft :<$ x & x & : (x, y)-> ( a~ 1 x, yb~ x ) is analytic and 
use the factorization jx = R(b) © L{a) © fi o (a x /?) and jU is analytic at (e, e) 
to show fi is analytic at (a, b). 


2. Local Lie Groups 


We now consider local groups which are manifolds and use these to also 
determine Lie groups. Since we are considering locally Euclidean groups, we 
shall henceforth assume nuclei to be connected. 

Definition 4.4 A local Lie group is a set B such that: 

(a) B is a connected analytic manifold; 

(b) B is a local group relative to the topology induced from the analytic 
structure in (a); 

(c) there is an open germ U of the local group B such that for the multi¬ 
plication function fi in B , the map ji: U x U -> B : ( jc , y) -> n(x, y) is analytic. 

Remark (1) As discussed for Lie groups, a local Lie group defines an 
analytic inverse operation i : W-* W : x~> x~ l defined on a suitable open 
neighborhood W of e in U. 
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(2) An analytic vector field yields a local one-parameter group cj) on an 
analytic manifold M and </> is a local Lie group. 

(3) An open nucleus of a Lie group is a local Lie group. 

Definition 4.5 (a) Two local Lie groups, B x and B 2 , are locally analyti¬ 
cally isomorphic if there exists a local isomorphism f:B 1 -+B 2 such that / 
and / _1 are analytic. 

(b) Two Lie groups G and G are locally isomorphic if they have open 
nuclei which as local Lie groups are locally analytically isomorphic. 

Example (1) An open neighborhood 2? x of e in the torus T 1 and an open 
neighborhood B 2 of 0 e R yield locally analytically isomorphic local Lie 
groups, but T 1 and R are not isomorphic Lie groups. 

Related to Proposition 4.2 is the following result [Cohn, 1957]: 

Proposition 4.6 Let G be an abstract group and let B be a subset of G 
such that: 

(a) B generates G as a group; 

(b) B is a (connected) local Lie group relative to the multiplication in G . 

Then there is defined on G exactly one analytic structure s4 which makes G 
into a connected Lie group ^ so that: 

(i) the group structure of ^ is the group structure of G; 

(ii) for some open nucleus U of B with coordinate map x, the chart 
(■U , x) e st. 

Corollary 4.7 Let G be a connected topological group which is a (top¬ 
ological) manifold and let (j) : G x G -> G : (x, y) -> xy~ l [= p(x, y~ 1 )]. Let U 
be an open neighborhood of e which has a given analytic structure so that the 
map n(U x U) U : (x, y) -> xy~ l is analytic. Then there exists a 

unique analytic structure s4 on G relative to which G becomes a Lie group 3? 
such that the topology of G equals the topology of # and the analytic structure 
of & restricted to a suitable open nucleus V <= U is equivalent to the given 
analytic structure on U. 

Proof (of Proposition 4.6) First we define the topology on G as follows. 
Let be the family of nuclei of B, and let H r — {W <= G : W n B e i r }. Then 
ij/~ is nonempty, since 'V c= Hf and iV satisfies the conditions of Proposition 
3.6 as follows. Conditions (a) and (b) are clear. For (c), let W l e iV' so that 
W x n B e if. Then there exists Kef c: ii r with VV~ X ci W 1 n B c= W t . For 
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(d), let W e'W and a e G. Then since B generates G as a group, we can write 
a = a x • * ■ a n where a, or of 1 is in B. Now since a^ea-, — eand W is a neighbor¬ 
hood of e , we have by continuity of the multiplication that there exists V n g tT 
with Similarly there exists V n ~ x e'f" such that 

a n-iK-i#n-i c K> and by induction, there exists V k e 'V such that 
a k l V k a k £ V k+X for k = 1,1. Thus 

a -1 Fja = o„ -1 • • • or 1 *Vi ‘' * 4, 

co" 1 • • • oJ 1 F 2 a 2 • • • o„ 

c ai'V.a.cW. 

Thus V x c alTa -1 , and since V x e 'V <= t^, we have by condition (b) applied 
to iV' that aWa -1 g Condition (e) holds for iV', since it holds for Y'. Thus 
by Proposition 3.6, G becomes a topological group with iT as a family of 
of nuclei. 

Now the topology defined on G by W- restricts to the original manifold 
topology given on B. For f c# and, conversely, any W e iV' contains a set 
W n B which is in iT. Therefore IF is a nucleus of G. Thus the topological 
group operations on (G, iT) are actually analytic near e so that by Proposition 
4.2, G can now be defined to be a Lie group ^ [noting by Proposition 3.23(a), 
B generates G so that G is connected]. 


3. Lie Subgroups 

We shall now define the concept of a Lie subgroup of a Lie group G and 
note that this concept differs from a topological subgroup because a Lie 
subgroup need not have the induced topology. 

Definition 4.8 Let G be a Lie group and let H be a Lie group. Then H is a 
Lie subgroup of G if H is an analytic submanifold of G and if H is a subgroup 
of G. 

Examples (1) The torus T 2 is a Lie group when regarded as a product 
group r 1 x T 1 and has as a submanifold the “ irrational wrap around ” curve 
as discussed in Section 2.3 (see Fig. 4.1). This curve is given by f(t) = 
(exp 2niat , exp 2nibt), where ajb = a is irrational. As we saw, this curve is a 
one-dimensional submanifold which is dense in T 2 , and since f(s + t) = 
mm in T 2 , it is therefore a Lie subgroup which is not closed. This Lie 
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subgroup does not have the induced topology since there are points on the 
curve which are arbitrarily close in the topology induced from T 2 but are 
arbitrarily far apart in the topology of the curve. 

(2) The integers Z are a zero-dimensional Lie subgroup of R. 

Exercise (1) Every discrete subgroup of a Lie group is a closed Lie 
subgroup (Hint: {e} is a neighborhood of e.) 

We shall now give various criterion for a subgroup to be a Lie subgroup 
and we need the following result [Helgason, 1962, p. 78]. 

Lemma 4.9 Let M and N be C 00 (analytic) manifolds, and let f:M->N 
be a C 00 (analytic) mapping such that f(M) is contained in a submanifold P . 
If the map f.M^P is continuous, then this map is also C 00 (analytic). 

Proof We shall show this as follows. Let the accompanying diagram be 

M N 



P 


commutative, where/is C 00 , F continuous, and i an immersion (since P is a 
submanifold). Then by the remarks following Proposition 2.23, for each p eP 
there is a neighborhood U of p, and a neighborhood V of i(p) e N and a 
C°°-map g : V U so that g o / = identity | U. Thus since / = i ° F, we have 
locally that 

F = identity °F=g°(i°F) = g°f, 

and since the right side is a composition of C®-functions, F is C 00 . In partic¬ 
ular, letting F=f and i be the identity; that is, letting P be a submanifold, we 
obtain the result. 
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Proposition 4.10 Let G be a Lie group and let H be a submanifold of G 
which is also an abstract subgroup of G. If H is also a topological group 
(relative to the topology induced from its analytic structure), then H is a Lie 
subgroup of G. 

Proof It suffices to show H is a Lie group and this follows from the 
preceding lemma. The mapping/: G x G -* G : (x, y) -* xy~ 1 is analytic and 
its restriction f H : H x H -* G is also analytic. Now since H is a topological 
group, the map f H : H x H —> H is continuous. Thus by Lemma 4.9, f H is 
analytic so that H is a Lie group. 

The next result follows from previous facts. 

Proposition 4.11 Let G be a Lie group and let H be a connected topo¬ 
logical subgroup of G. Then there is at most one analytic structure on H 
which makes H into a Lie subgroup of G. 

We now give some computational results which determine Lie subgroups. 

Proposition 4.12 Let H be a Lie group which is an abstract subgroup of 
the Lie group G. Assume at the identity e e G there exist an analytic chart 
(£/, x) in G and an analytic chart (V, y) at e in H such that x t \H = y t and 
(i dyjdxjie )) has rank equal to the dimension of H. Then H is a Lie subgroup 
of G. 

Proof We first translate the charts at e to any point a e H by the analytic 
diffeomorphisms L(a) (of H and G) so that we can now apply Corollary 2.12 
to obtain H is a submanifold. 

This result can also be stated in terms of local Lie groups which generate 
a subgroup. 

Corollary 4.13 Let G be a Lie group and let B be a local Lie group relative 
to the group operations in G. If there exist charts (G, x) at e in G and (V, y) at 
einB such that x t | B = y t and rank {dyjdx,{e)) = dim B y then the subgroup H 
generated by B is a connected Lie subgroup of G. 

We now note that the topological and manifold structure is mostly in the 
identity component. This will also become more evident when the Lie algebras 
are also taken into consideration. 
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Proposition 4.14 Let G be a Lie group and let G 0 be the identity compo¬ 
nent of G (as a topological group). Then: 

(a) G 0 is an open normal Lie subgroup of G; 

(b) T(G, e) — T(G 0 , e) and therefore dim G = dim G 0 ; 

(c) G/G 0 is a Lie group which is discrete. 

Proof Since G is locally Euclidean, it has a connected open neighborhood 
U of e in G and from Proposition 3.23, U generates a connected subgroup H of 
G. Since G 0 is the identity component, we have by maximality that G 0 H. 
However, H contains the neighborhood U of e in G 0 and G 0 is connected. 
Therefore G 0 — H. Now G 0 contains the neighborhood U of e and U is open 
in G so that any a e G 0 is in the open neighborhood aU. Thus G 0 is open in G. 
This means G 0 is an open submanifold of G so that dim G = dim G 0 and 
T(G , e) = T(G 0 , e). Also by Theorem 3.22, G/G 0 is discrete. 

Remarks (1) If G is a connected Lie group and H a proper Lie sub¬ 
group, then dim H < dim G for otherwise H contains an open nucleus of G 
which generates G; that is, G = H. 

(2) We shall show later that if G is a Lie group and if H is a closed sub¬ 
group of G, then H is a Lie subgroup of G. Thus the previously discussed 
subgroups 0(n ), SL(ri), and Sp(n) are all closed Lie subgroups of GL(n , R ). 

(3) We shall consider later normal Lie subgroups when we discuss homo- 
morphisms. 



CHAPTER 5 


THE LIE ALGEBRA OF A LIE GROUP 


We have seen from previous examples that the tangent space T(G, e) of a 
Lie group G can be used to give local information about G. In this chapter we 
formalize this situation by introducing the set of G-invariant vector fields 
-Sf(Cr) and seeing that it is a vector space which is isomorphic to T(G, e). Also 
t£{G) is a Lie algebra over R and induces a Lie algebra structure on T(G, e). 
Using this we define the exponential map exp : Z£(G) -* G in terms of homo- 
morphisms of R into G. The exponential map is a local diffeomorphism 
exp : U 0 -> U e of a suitable neighborhood U 0 of 0 in JS?(G) onto a neighbor¬ 
hood U e of e in G. Using the inverse function log : U e -> U 0 we define canoni¬ 
cal coordinates ( U e , log) at e in G. Thus by the action of L(a): G -> G : x-> ax 
we obtain coordinates at any point ae G. 

The exponential map is used to obtain a local representation of the multi¬ 
plication in G analogous to the results of Section 1.6. Thus for X and Y 
sufficiently near 0 in S£{G) we can write exp X exp Y = exp F(X, Y) where 
F : SF(G) x G£(G) -► < S?(G) is analytic at (0, 0) e Jg ?{G) x JF(G). We show that 
the terms F k (0,0)(X, Y) m of the Taylor’s series for F are in the subalgebra of 
■Sf(G) generated by X and Y. We briefly discuss the actual formula for F(X, Y ) 
which is known as the Campbell-Hausdorff formula. Finally we show that a 
continuous homomorphism of Lie groups is analytic. This yields the fact that 
the analytic structure of a Lie group is uniquely determined by its topology. 
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1. The Lie Algebra 


We now introduce the Lie algebra of a Lie group G in terms of invariant 
vector fields. Thus the Lie algebra will be determined by the tangent space 
T(G, e) and the action of G determines the values of the vector fields at any 
other point in G. 

Definition 5.1 An analytic vector field X e D(G) defined on a Lie group 
G is called invariant if for all a e G 

[(TL(a))(e)]X(e) = X(a). 

Thus as in Section 2.7 we have that since {TL(a))(e): T(G, e) -> T(G, a), then 
the value [(TL(a))(e)]X(e) actually equals X(a). 

Next we note that if X is invariant, then X is L(a)-invariant for all a e G; 
that is, X is actually G-invariant or left invariant according to Section 2.7. For 
let peG, then 

X(L(a)p) = X(ap) = [(TL(ap))(e)]X(e) 

= [T(L(a) o L(p))(e)]X(e) 

= [TL(a)(p)] ■ (TL(p)(e))(X(e)) 

= TL(a)(p ) • X(p) 

which gives the result. 


Proposition 5.2 Let G be a Lie group, let X e T(G , e ), and let 


X:G^T(G):p^X(p), 


where T(G) is the tangent bundle of G with projection map n and X(p) is given 
by 


(Xf)(p) = X(f o L(p)) 


where/is any real-valued analytic function on G. Then X is a G-invariant ana¬ 
lytic vector field on G such that X(e) = X. Furthermore X is the unique 
G-invariant vector field on G such that X(e) = X. Thus any G-invariant 
vector field is of the form X. 


Proof Letting TL(p) = TL(p)(e) we first note that ( Xf){p ) = ( TL(p)X)(f) 
so that X(p) e T(G, p) and therefore (n ° X)(p) = p. Thus X is a vector field 
on G. Since X(p) = TL(p)X we have X(e) = X and the above computations 
show X is G-invariant. For the uniqueness we use Proposition 2.34 with 
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/ = L(a) for any aeGor directly as follows. Let Z be a G-invariant vector 
field with Z(e) = X Then Z(p) = TL(p)(e)Z(e) = TL(p)(e)X = X(p). Finally 
we shall show X is analytic and derive another formula for it. Thus let 
a: I-+G : t-> a(t) be an analytic curve on an interval / containing 0 e R so 
that a(0) = X[— T(e)] and a(0) = e. Then analogous to the results in Section 
2.7 we use the results on curves in Section 2.5 to obtain 

(Xf)(p) = X(fo L(p)) 

= d/dt(0)(fo L(p) o a) - djdt[f (jpa(f))] f _ 0 (*) 

where pot(t) is the analytic product in G. Thus since/, a, and the multiplication 
in G are analytic we have Xf is an analytic function; that is, X is analytic. 

Let J*?(G) denote the set of G-invariant vector fields on G. Then from the 
above result we see that £?(G) consists of all vectors of the form X for 
X e T(G, e ). From X(p) = TL(p)(e)X and TL(p)(e) being injective we obtain 
the following. 

Corollary 5.3 The map (p : ££(G) -► T(G , e) : X -► X is a vector space 
isomorphism. In particular, the dimension of $£(G) over R equals the dimen¬ 
sion of G and is finite. 

Corollary 5.4 <£{G) is a Lie algebra relative to the bracket operation 

[£, Y] = X Y - YX. 

Proof This follows from Proposition 2.33. 

Definition 5.5 (a) The Lie algebra of a Lie group G is the Lie algebra 
S£(G) of G-invariant vector fields on G. 

(b) The Lie algebra g with product [ ] g is homomorphic to the Lie algebra 
h with product [ ] h if there is a vector space homomorphism (p : g h such that 
<t>[XY] g = [cpX(pY] h for all X, Y e g. If (p is a vector space isomorphism, then 
g and h are isomorphic Lie algebras. 

By means of Corollary 5.4 we can make T(G, e) into a Lie algebra as 
follows. Let X, Y e T(G , e ) and let X, Y e 3?(G) as above. Then define the 
product [AT] = [X , Y](e) which is in T(G, e) and makes T(G , e ) into a Lie 
algebra. This yields the following. 

Corollary 5.6 The map <p : J£?(G) T(G , e) : X -► X is a Lie algebra 

isomorphism. 

Frequently the Lie algebra T(G , e ) is also called the “Lie algebra of G.” 
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Example (1) Let G = GL(V). Then from Corollary 5.6 we have the map 
4>: ££(G) -► T(G, I) is a Lie algebra isomorphism using the product [AT] = 
[X , Y](l) in T(G , /). However, we also have the Lie algebra gl(V) attached to G 
and we now show that &{G) is isomorphic to gl(V) as Lie algebras. Recall 
from example (3), Section 2.5 that for each A e gl(V) we defined an element 
A g T(G , /) by 

(Ah) = [ Dh(I)]A , 

h analytic at /. The map gl(V) T(G , I ): A -> A is a vector space isomorphism. 
Thus we obtain a vector field A in &(G) and consequently a vector space 
isomorphism gl(V) -► if(G) : A -+A. We now show this is a Lie algebra 

isomorphism; that is, [%> B] = [A, B\ Usually T(G, I) and gl(V) are considered 
the same and the overbar is omitted as done before but we shall not do this 
now. Let p e G, and A, B g gl(V). Then using L(p)A = pA, the product in 
End(F), we have for / analytic on G 

(% B )(f)(p) = 3(S(/))(p) 

= A(S(f) ° L(p)), definition of X 

= o L(p))(/)]^4, definition of A 

= l D ( B (f))(p) ° D(L(p))(I)]A, chain rule 
= [D(B(f))(p)](pA), L(p) linear 

= lim 1 [{Bf)(p + tpA) - (S/)(p)] 

t-> 0 t 

= lim 1 [B(fo L(P + tpA )) - B(fo L(P))] 

t^O t 

= lim \{[D{fo L(p + tpA))(I)]B - [D(fo L{p))(l)]B } 
f-* 0 t 

= lim ~{[Df(p + tpA)](pB + tpAB) 
t-*o t 

— [Df(p)](pB)}’ chain rule 

= lim \ [Df(p + tpA)(pB) - Df(p)(pB)] 


+ lim - [Df(p 4- tpA)](tpAB) 
r-> o t 

= D 2 f(p)(pB, pA) + Df{p)(pAB). 
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Interchanging A and B, subtracting the equations, and using the fact that 
D 2 f(p) is symmetric we obtain 

(XB - BA)(f)(p) = Df(p)(pAB - pBA) 

= Df(p)(p[A, 5]) 

= [D(f° L(P))(I)][A, B] 

= [g(/oL(p» 

= {[ATB]){f)(p) 

which proves the result. 

Exercises (1) Let G be a Lie group with group multiplication p: 
G x G -*• G and consider the tangent map ( Tp)(a , b) : T(G, a)x T(G, b ) -> T(G, 
p(a, b)) as in exercise (3), Section 2.7. From that exercise we obtain vector 
fields given by l(p, X)(a) = [(Tp)(a, e)](0, X) and r(p, X)(a) = [(Tp)(e, a)](X, 0) 
for X e T (G, e) and 0 e T(G, a). 

(i) Are the vector fields l(p, X) or r{p, X) left invariant or right invariant 
under the action of G; that is, invariant under the set of functions L(G) or 
i?(G)? 

(ii) If they are invariant, how do they compare with the vector field X 
given in this section? 

(2) Let G and H be Lie groups with Lie algebras g and h. Then as in 
Section 4.1, the product group G x H is a Lie group. Show g x h with the 
pointwise operations is the Lie algebra of G x H ; that is, use the product 
[(X u Y i ){X 1 , Y 2 )] = ([X l X 2 ), [7,7,]). 

(3) Let 'A(G) be the set of vector fields which are /^(G)-invariant. Show 
91(G) is a Lie algebra. How is it related to JSf(G) ? [Possibly consider R(a) ° i = 
i o L(a~ l ) and [ri(a -1 )]X(a -1 )]. 

(4) Show the vector field X e -S?(G) depends analytically on the para¬ 
meter X e T(G, e ) (see Section 2.8). 


2. The Exponential Map 


In this section we generalize the map exp : gl(V)^>GL(V) to the map 
exp : 9'(G) —> G which allows us to coordinatize G so that the multiplicative 
properties of G are nicely translated into properties of the Lie algebra if(G). 
First let us recall the function exp : gl(V) -► GL(V) as discussed in Chapter 1. 
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We saw that there exist neighborhoods U 0 of 0 in gl(V) and U, of/in GL(V) 
so that exp : U 0 -* U, is an analytic diffeomorphism. Also for any X e gl( V) 
and s, t, s + t near enough 0 e R we have exp sX, exp tX defined, and 
exp(s + t)X = exp sX • exp tX in GUV). Thus identifying X with J? in 
if(GL(F)) we have the fact that <f>(t) = exp tX is a local one-parameter group 
on GL(V) determined by X (see Section 2.8). Furthermore we saw that </> is 
actually defined on all of R; that is, <p :R->GL(V ) is an analytic homo¬ 
morphism of Lie groups. 

Now for an arbitrary Lie group G we would like to start with an arbitrary 
G-invariant vector field X e Sf(G), form the local one-parameter group </>(;) 
determined by X, and then extend <f>(t) to a global one-parameter group 

R->G. Using this we define exp X = $(1) which is consistent with the 
results for GL{V). Thus in this section we shall first consider the extension 
problem so that the exponential map can be defined as described. It should be 
noted that there are many other possible approaches to this definition depend¬ 
ing on the properties one wants to assume [Chevalley, 1946; Helgason, 1962; 
Loos, 1969]. 

Lemma 5.7 Let V = R m be the w-dimensional vector space with the 
usual Euclidean topology and let B be an open ball with center 0 e V. Let G be 
a topological group and let <f> : B -*■ G be a continuous local homomorphism 
of the additive group structure of V ; that is, if x, y,x + y e B, then <j>(x + y) = 
<fi(x)(j){y) in G. Then there exists a unique continuous homomorphism/: V -*G 
such that f\B = <t>. 

Proof Let x e V and let p be a positive integer such that x/n e B for all 
n> p. Then we set/(x) = (f>(x/p) p . We now show/is well defined; that is,/is 
a function. Thus let q be a positive integer such that xjn e B for all n > q. Then 

(p{xlpq) pq = [<j>(y/q)‘ l ] p , where y - x/p e B 

= [<i>{ylq) ■ ■ ■ <Kylq)Y 
= Wy/g + ■■• + y/q)] p = <P(x/p) p , 

so that interchanging p and q we obtain <j>(x/p) p = 4*(x/q )*. Thus the definition 
of/is independent of the choice of p so that/is a function. Next, for x, y e V, 
let p be such that x/p, y/p, and (x 4- y)/pe B. Then since 4 > ( x /p)4 > (y/p) = 
+ y)/p) = <P((y + x)!p) = 4>(y/p)4>(x/p) we obtain 

f(x + y)= [4>((x + y)/p)] p 

= [4>(x/p)4>(y/p)Y 

= 4>(x/p) p 4>(y/p) p =f(x)f(y). 

Thus /: V -+G is a homomorphism. 
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We now show/| B = <f>. Thus let x e B with p as above so that x/p e B and 
x = x/p + ■ ■ ■ + x/p in B. Then using (f> as a local homomorphism, we obtain 

fix) = <Kx/pY 

= (j){xlp) ■ ■ ■ 4>(xlp) 

= <j p(x/p + • • ■ x/p) = 4>{x). 

Finally/is unique, for if g : V-> G is another continuous homomorphism 
with g\B = 4>, then if a e V and p such that xjn e B for all n>p, we have 

g(x) = g(p(x/p)) = g(x/p) p = 4>(x/p) p -fix). 

Corollary 5.8 An analytic local homomorphism a of an interval I about 
0 g R into a Lie group G can be extended uniquely to an analytic homomorph¬ 
ism / of R into G. 

Proof Just note that I contains an open interval with center 0 and that 
the above definition of f is given in terms of analytic operations in R and G. 

Theorem 5.9 Let G be a Lie group, let X e T{G, e), and let X e =Sf(G) be 
the corresponding G-invariant vector field. Then there exists exactly one 
analytic homomorphism f : R — ► G such that /(0) = X . Thus f is the maximal 
integral curve of X through e e G; that is, f(t) = X{f(t)) for all t e R. 

Proof Let / = (-e, e) and a : I -* G be the integral curve of X such that 
a(0) = e as discussed in Theorem 2.37. We shall now show that a is an analytic 
local homomorphism. Let J be a suitable subinterval of I containing 0 e R 
such that for t fixed in I the analytic maps 

u : J -► G : 5 -> ait + s) and v:J-+G:s-> a(t)oe(s) 

are defined. Then u(s) = [a ° t(()](.v), where r(l) :R-*R:x-*t + x and r'(s) = 
[L(a(0) ° a](V). Using the results on curves we have 

u(s) = [T(a. o T(t))(d/du)](s) 

= (7Y)(t(00)) ° [{Tx(t))(s)](dldu) 

= (T'i)(t + s)(d/du) 

= <* it + s) = X(a(t + i)) = X(«(i)), 

where we use the chain rule for the second equality, [7r(t)](i) is the identity 
for the third equality, and a is an integral curve for the fifth equality. 
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Next we have 


m = [T(L(y.(,)) o a )(d/du)](s) 

= [rL(a(/))](a(j)) o (Ty)(s)(dldu) 

= [(rL(«(0)(«(5))](a( 5 )) 

= [rL(a(0)(a(*))]X(a(s)) 

= [TL(y(t))X](*( S )) 

= [X o L(a(0)](a(.v)) 

= X(y(t)y.(s)) = J?(»(s)), 

where we use the chain rule for the second equality and X is G-invariant for 
the sixth equality. Thus we see that u(s) and v(s) are solutions to the dif¬ 
ferential equation z(s) = X(z(sj) and satisfy u( 0) = v(0) = a(f). Thus by 
Theorem 2.37 there is a neighborhood J' of 0 e R where u(s) = v(s) ; that is, 
there is a suitable neighborhood N of 0 e R so that a : N -*• G is an analytic 
local homomorphism. The theorem now follows from Corollary 5.8 and the 
following which shows f(t) = X( f(t)) for all t e R. Thus let t = t x + t 2 e R, 
where t x , t 2 e N which is a neighborhood of 0 e R such that / is an integral 
curve; that is,/= a. In particular, f(t 2 ) = X(f(t 2 )) and/(q + t 2 ) = f(t 1 )f(t 2 ). 
Now since X is defined on G, we have from various definitions and the G- 
invariance of X 


X(f(t)) = X(f( k + t 2 )) 

= X(L( /(q))/(t 2 )) 

= [TL{f(tj)]f{t 2 )X(f{t 2 )) 

= 2))Kh) 

= [m/( fl ))](/(q)) • m)(d/du)](t 2 ) 

= [r(L(/(q)) o/)W /t / M )](q) 

= [T(f° <h))(d/du)](t 2 ) 

= (77)(q + t 2 ) o (Tx( tl ))(t 2 )(d/du) 

= (Tf)(t)(d/du) = f(t). 

Noting that any t e R can be written as t = = i where t t e N, we use/as 

a homomorphism and induction on n to obtain the result. Also because the 
domain of / equals R , / is the unique maximal integral curve through e e G. 

Exercise (1) How can the results of Theorem 2.37(b) be formulated in 
the present context to give the above result directly? 
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Definition 5.10 Let G be a Lie group and let X e ££(G) be the vector 
field corresponding to X e T(G , e ). Let /* denote the unique analytic homo¬ 
morphism of R into G of Theorem 5.9 such that /*(0 — Then we 

define the exponential map exp or exp G to be the map with domain 3?{G) 
given by 

exp:^(G)^G:X^/*(l). 

We first note that for all t e R , 

exp tX =/,x( 1) =/x(0- (*) 

The first equality is the definition. To see the second let t be fixed (but 
arbitrary) and let 

g : R->G:s -+fx(ts). 

Then g is an analytic homomorphism and if t*(0 : R -► R : x -► tx we have 
0 ( 5 ) = [f% o t*( 0]C$). Consequently 

m = tn/x ° T*(t))(d/du)m 

= (Tfx)(x*(t)(0)) o [Tx*(t)](0)(d/du) 

= C Tfx)(PMd/du )) = tX 

using r*(t) as a linear transformation of for the third equality. However, 
the homomorphism f t % : R -► G is also such that /rx(0) = tA" so that by the 
uniqueness part of Theorem 5.9 we obtain f t x(s) —fx(st) which gives the 
result. 

Remark (1) From formula (*) we see that the curve R-+ G : t -*• exp tX 
can be characterized as the curve f\R-+G such that: 

(i) / is an analytic homomorphism, 

(ii) /(0 = £(/(0) all t e R. 

We also have for all x, t e R and X e £?(G) that 

exp(x + t)X = exp sX • exp fX, [exp tX]~~ l = exp ( — tX) 
since/* is a homomorphism. 

Theorem 5.11 Let G be a Lie group with Lie algebra ^f(G). Then the 
exponential map exp : <£?((/)-► G is analytic and T(exp)(0): T(2?(G), 0) -> 
T(G , e) is a nonsingular linear transformation. 

Proof Since the vector field X depends analytically on the parameter 
X e T(G, e) we have from Theorem 2.39 that there exist an open interval 
( — e, e) of 0 e R, an open convex neighborhood U of 0 in T(G, e), and an 
analytic mapping u : ( — e, e) x U -> G such that for each t e (-e, e) and each 
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X e U we have u(t, X) Now let a e ( — e, e) be a fixed number such that 

0 < a < 1 and let U' = aU c U. Then the map U' -* G : X -> u(a, a~ l X) is a 
well-defined analytic function since ae(—8 , e) and a~ l Xea~ 1 U l = U. 
However, we have 

u(a, a~ l X) =f a -ix(a) =f i ( a ~ 1 a) = exp X. 

Thus using the isomorphism $ : &(G) -* T(G, e): X -> X of Corollary 5.3 we 
see exp ° <p~ 1 is analytic on U’ since 4> is a linear transformation which is 
analytic; that is, exp o 0 _1 is analytic at 0 e T(G, e). 

Next we shall show exp ° <f>~ x is analytic on all of T(G, e), for let X be any 
element in T(G, e). Then there is a neighborhood D of X and an integer p > 0 
so that 1/pD <= {/'. However, since [exp(l/pA')] p = exp X and exp is analytic 
on V, we see that exp X is the (analytic) product of analytic functions so that 
exp is analytic on D. Finally, T(G, e ) is isomorphic to i?(G) by the analytic 
linear transformation Thus exp = (exp ° <£ -1 ) c </> is analytic on SC{G). 

Next, to show 7'(exp)(0) is nonsingular we recall from Section 2.4 that the 
tangent space of the finite-dimensional vector space &(G) at the point 
0 e -£?(G) equals the vector space of all directional derivatives evaluated at 0; 
that is, T(jS?(G), 0) = {£>*(0): X e: £?(G)} and : T(if(G), 0) : D$( 0) - 

X is a vector space isomorphism. Now let />(?) = exp tX and let k be any 
function which is analytic at eeG. Then 

[(T(exp)(0))(Z) j f(0)](A:) = (D~ x (0))(k o exp) 

= lim X - [k - k o/x(0)] 

t~*o t 

= J t [k{e */x(0)]f=o = (Xk)(e) = X(k) 9 

where the fourth equality uses formula (*) in Section 5.1 and the fifth equality 
uses X(e) = X. Thus 

r(exp)(0) • 0) = X. 

However, the isomorphisms </) and \j/ above give <$>% = X and ^(Z)^(0)) — X 
so that T(exp)(0) = <f> o \jj which is nonsingular. 

Notation In the above proof we kept track of the various vector spaces 
by the isomorphisms </> and if/. However, to simplify notation, these vector 
spaces (and Lie algebras) are usually identified and the isomorphisms are 
ignored—it all depends on which space one takes as the definition of the Lie 
algebra. We shall identify as much as possible and use the Lie algebra which is 
most convenient. Thus we shall write JS?(G) = T(G, e) = g or equal to any 
other useful isomorphic characterization; for example, J£?(GL(F)) = 
T(GL(V ), /) = gl(V). We can now consider the exponential map as defined on 
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T(G, e) = g and note that, using <^ = ^=7, Theorem 5.11 is frequently 
stated as follows. 

Corollary 5.12 Let G be a Lie group with Lie algebra g. Then the expo¬ 
nential map exp : g -» G is analytic and T(exp)(0) = 7, the identity in End(g). 

Example (1) For the Lie group G - GL(V ) we have identified its Lie 
algebra i?(G) with gl(V). Thus for X e gl(V) we have/(0 = exp tX as defined 
in this section is characterized by (i) and (ii) of remark (1) preceding Theorem 
5.11. However, we have previously seen that the matrix function e ,x = 
£ fX n /n\ satisfies the same conditions. Thus by the uniqueness we obtain the 
consistent result exp tX = e ,x . 

Proposition 5.13 Let G be a Lie group with Lie algebra g. Then there 
exist a bounded open connected neighborhood U 0 of 0 eg and an open 
neighborhood U e of e e G such that exp : Uq —► U e \ X —► exp X is an analytic 
diffeomorphism. 

Proof We use the inverse function theorem as stated in Section 2.5 for 
analytic functions. 

Definition 5.14 Let log :U e ->U 0 denote the analytic inverse of 
exp : U 0 -* U e given above and let g : g -*• R m be a vector space isomorphism. 
Then D 0 = g{U 0 ) is open in R m and the pair (U e , r\ °log) is a chart at e e G 
called a canonical or normal chart at e e G and U e is called a canonical or 
normal neighborhood of e. 

The isomorphism t] is frequently omitted and we just consider the pair 
(U e , log) as a canonical chart. Thus if X u X m is a basis of g, then D 0 = 
{Oq,..., xj e R m : £ jq X t e U 0 } and the explicit coordinate map is 

m 

log : U e -> D 0 : exp( £ x t X t ) -> (x„ ..., x m ). 

i= 1 

As discussed for the Lie group GL(V), we see in general that any point a in a 
Lie group G has an (analytic) chart given by (aU e , log « L(a) 1 ). 

Remark (2) The image exp : g -> G is contained in the connected com¬ 
ponent of the identity G 0 . However, exp need not be surjective, for let 
G = 57.(2, R) which is the Lie group of 2 x 2 matrices of determinant 1. We 
shall now briefly show there exists an element A not of the form exp X = e x . 
Thus let 
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where r < — 1. Then if A = e x , the characteristic roots of A are of the form e a 
and e b where a, b are characteristic roots of X. Suppose r — e a and l/r = e b . 
Then a = -b + 2tori. However, since r < 0, a is actually complex and there¬ 
fore its conjugate is also a characteristic root; that is, b — a. This gives a as 
pure imaginary. Thus we obtain a contradiction 1 = \e a \ = \r\ > 1, by the 
assumption r < — 1. This contradicts A = e x . 


3. Exponential Formulas 

In this section we use the exponential function to develop a Taylor’s 
series expansion for a real-valued analytic function defined on a Lie group 
(see Section 2.8). Then using this we obtain the first few terms in the expansion 
of the analytic function F given by 

exp X exp Y — exp F(X, T) 

for X and Y in a suitable neighborhood of 0 in T(G , e) = g. Thus for 6 = (0, 0) 
in g x g we obtain the local approximation for the multiplication in G by 

F(X, Y) = X + Y +1 [XY] + 1 F 3 (9)(X, Y) (3) + ■ • • 

analogous to the results in Section 1.6. We shall show that the higher-order 
terms F k (0)(X, Y) ik) are all contained in the subalgebra of g generated by X 
and Y. Finally we discuss the Campbell-Hausdorff formula for F(X, Y). 

Proposition 5.15 Let G be a Lie group, let / be a real-valued function 
analytic at p e G, and let X e J£?(G) with X e T(G , e). Then there exists e > 0 
such that, for \t\ < e, 

f(p exp tX) = X — (X n f)(p). 

n = 0 n\ 

We shall refer to this formula as a Taylor’s series expansion for f 

Proof From formula (*) in Proposition 5.2, we have for a(/) = exp tX 
that 

(Xf)(p) = [(d/dt)f(p exp tX)] t= o 
which proves the formula 

(X n f)(p) = [(d/dt) n f(p exp tX)l= 0 
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for n = 1, and we now continue by induction. Thus 
(X" + x f){p) = [X"(Xf)](p) 

= [(d/dtT(Xf)(pcxptX)]^ 0 
= [(d/dt) n (d/ds)f(p exp tX exp sX)] s =o» »=o 
= [(d/du)"(d/du)f(p exp uX)] u = 0 
= [(d/du) n+l f(p exp uX)U o 

where the fourth equality uses exp tX exp sX = exp(f + s)X and u = t + s. 

Now since / is analytic at p, then for some e > 0 and for 1 1 1 < e we have 
the fact that f(p exp tX) is an analytic function of t at 0 e R. Thus we can 
write the power series 

00 

f(p exp tX) = X a" t n jn\ 

n = 0 

for |*| < e, where the a n equal the «th derivative of f(p exp tX) at t = 0; that 
is, a n = ( X tt f)(p), which proves the Taylor’s series expansion. 

Theorem 5.16 Let G be a Lie group and let X, Y e T(G , e). Then there 
exists s > 0 such that for \t\ < e: 

(a) exp tX exp t Y = exp (tX + tY + \t 2 [XY\ + o(t 3 )); 

(b) exp tX exp t Y exp( -tX) = exp(* Y + t 2 [XY] + o(t 3 )) ; 

(c) exp(-*X)exp(-*Y) exp tX exp tY = exp (t 2 [XY] + o(t 3 )); 

where in each case o(t 3 ) is a vector in T(G , e) such that for \ t\ < e, (1 jt 3 )o{t 3 ) 
is bounded and analytic. 

Proof (a) Let /be analytic at p e G. Then using the formula 
(XY)(P) = Mdtffip exp tX )] t= o 
twice we obtain for a function g analytic at e e G 

(X n Y m g)(e) = [(d/dt) n (dlds) m g(exp tJYexp ^Y)] s = 0 , • 

Therefore we obtain the Taylor’s series expansion 

t n s m 

g(exp tX exp sY) = Y. [X"Y m g](e) 

m,n> 0 ft ! Wl! 

for s 9 t sufficiently near 0 e R. Thus for s = t we obtain 

jin + n 

g(exp tX exp tY) = £ —- [X”Y m g](e), 

m, n^O Wl,ft • 


(l) 
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where the coefficient of t 1 is {Xg){e) + ( Yg)(e ) and the coefficient of t 2 is 
i(X 2 g)(e) + ix?g](e) + i(Y 2 g)(e). 

However, by Proposition 5.13 and the analyticity of the multiplication in G 
exp t X exp t Y = exp F(t ) 

where F: I -* T(G , e) is an analytic map of an open interval / c which 
contains Oei?. Since e = exp 0 = exp 0 exp 0, we see that F( 0) = 0. Thus for 
t el, F has the Taylor’s series expansion 

F(t) = tF i + t 2 F 2 + o(t 3 ) 

for fixed F u F 2 e T(G , e). 

Since we are working near e e G, we can assume the operations take place 
in a normal neighborhood U e , and we now have for # analytic at e e G that 

#(exp t X exp t Y) — g[tx p F(t)] 

= g[^V>{tFi + t 2 F 2 + o(t 3 ))] 

= gfcxpitFi + t 2 F 2 )] + o'(t 3 ) 

= t~. [( tF , + t 2 F 2 yg](e ) + o'(t 3 ), (2) 

where the last equality uses Taylor’s formula for tF t + t 2 F 2 and where o\t z ) 
denotes a real number such that for some e > 0, (1 /t 3 )o'(t 3 ) is analytic and 
bounded for \ t\ < e. From the expression in the last equality we see that the 
coefficient of t 1 is [F t g](e) and the coefficient of t 2 is [F z g](e) + \[F 1 2 g]{e). 

Thus comparing the coefficients of t and t 2 in formulas (1) and (2) we 
obtain 

(F,g)(e) = (Xg)(e) + (Yg)(e) and (F 2 g)(e) = {^[X, Y]g)(e). 

Since g is an arbitrary analytic function at e e G we obtain F t = X + Y and 
F 2 = i[XY] which shows 

exp tX exp t Y — exp F(t) = exp (tX + t Y + \t 2 [XY] + o(t 3 )). 

For t small enough, we use (a) to compute (b) as follows. 

[exp tX exp t Y] exp( — tX) = [exp (tX + t Y + \t 2 [XY] + tf(t 3 ))] exp(— tX) 

= exp 5(0, 

where from (a) 

S(t) = (tX+tY+it 2 [XY]) + (-tX)+}[tX+tY+i1*lXYl -tX]+ o(t 3 ) 
= tY + t 2 [XY] + o(t 3 ) 

which proves (b). 

(c) Part (c) is proven similarly. 
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Remarks (1) With the conditions and formulas of Theorem 5.16 we 
can show 

(d) exp t(X 4- Y) = exp tX * exp t Y • exp o(t 2 ); 

(e) exp (t 2 [ XY]) = exp( — tX) exp( — t Y) exp tX exp t Y • exp o(t 3 ). 

Also we have the following formulas which we shall use later. For any t e R, 

(f) exp t(X 4- Y) = lim^texp (t/n)X exp (t/n) Y] n ; 

(g) exp(f 2 [AT]) = liin„_ 00 {exp[ - (t/n)X] exp [-(t/n) Y] 

Qxp(t/n)X Qxp(t/n) Y} n \ 

For example, to see (f), let t be fixed in R and let n be sufficiently large. Then 
from Theorem 5.16, 

CXP l n Xi£XPl n Y = CXP [n iX+Y) + h [XY] + ’(j?). 

and consequently 

exp - X exp - Y = exp \t(X 4- Y) 4- [XY] 4- 

n n J L 2» \n ! J 

which yields the result. 

(2) Let G be a Lie group with Lie algebra g. Then the above formulas 
are frequently expressed as follows. 

(a') There exists a neighborhood U of 0 in g such that for all X, Y e U 
we have exp X exp Y = exp(X 4- Y 4- i[XY] 4- e(X 9 7)) where 

lim s(X, Y)I\\X\\ 2 ||r|| 2 = 0. 

x , r-o 

(a") For any X, Yeg and for 5, t sufficiently near 0 e R, exp sX exp t Y 
= exp^X 4- t Y + \st[XY] 4- e(^, 0) where lim s> f _ 0 e(.s, t)/st = 0. 

(3) Now let g : G x G: (x, y) g(x, y) = xy denote the analytic multi¬ 
plication on G . Then 

(Tg)(e, e): T(G , e) x T(G, e) -+ T(G, e ) :(X, Y)^X+ 7, 

for let t be near 0 e R. Let ^(exp tX, exp tY) = exp F(t) where F(i) = tX 4- 
t Y 4- o(t 2 ). Then using the chain rule, r(exp)(0) = /, and d/du(0)(F) = X + Y 
we obtain the formula. 

Proposition 5.17 Let G be a Lie group and let g u ..., g k be subspaces of 
the Lie algebra g such that g is the subspace direct sum + • • • + g k . Then 
the analytic map 

<t> ■ gi + • • 4 + g k -* G : X 1 H - + X k -> (exp X k ) ■ ■ ■ (exp X k ) 
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is such that Tcj)( 0) = I. Thus there exist bounded open connected neighbor¬ 
hoods of Ui of 0 in g t and U e of e in G such that the map 

: U x + * • * + U k -> U e : X x + ■ ■ * + X k - (exp (exp X k ) 

is an analytic diffeomorphism. 

Proof Using induction and the inverse function theorem it suffices to 
show T(j)(fi) = / for the case g — g t + g 2 . Thus let exp; = exp G | g t . Then since 
g t czg we see r(exp f )(0) is the identity map on g { . Therefore identifying 
9\ + 9 1 with 9 1 x g 2 we have for X t e g t that X x + X 2 = (X u X 2 ) and there¬ 
fore, with g the multiplication in G , 

(T&mx, + X 2 ) = [(J0X0, 0)K*1, * 2 ) 

= [roi o expj x exp 2 )(0, 0)](A' 1 A r 2 ) 

- (?»(*, e)((J ex Pl )(0)^, (rexp 2 )(0)Af 2 ) 

using (j) = go (exp A x exp 2 ), the chain rule, and remark (3) above. This proves 
the result. 

Remarks (4) Using this result we can introduce coordinates as follows. 
Let X u .X m be a basis of g so that X u ..., X r(l) is a basis of g u X ril)+l9 
..., A" r(2) is a basis of g 2 , etc., for g = 9i + ' " + g k as above. Let U e = 
<t>(Ui + ••• + U fc ) be as in Proposition 5.17 so that elements of U e are of 
the form 

HD r(fc) 

<t>(L x i X t) = ( ex P £ *. AT) ■ • • (exp £ x i A' i ). 

Let £> 0 = {(Xi, .... x m ) e /? m : £x ; X t e £/, + • • • + U k c. g}. Then the map 
U e ^D 0 : 0(Xx f AT,) *(Xj .x m ) 

yields a coordinate system called a canonical or normal coordinate system of 
the second kind. 

(5) From formula (c) of Theorem 5.16 we see that [XY] is the tangent 
vector to the “commutator curve” 

s -► exp(— s 1/2 X) exp (~s i/2 Y) exp(.y 1/2 20 exp(s 1/2 T), s >0 

at s — 0. Thus frequently the operation [XY] in g is referred to as the “com¬ 
mutator operation in the Lie algebra.” In the work of Pontryagin [1946] this 
is actually used as the definition of a bilinear multiplication on T(G, e) which 
makes it into a Lie algebra. 

(6) Let U 0 be the open neighborhood of 0 e g as given in Theorem 5.13. 
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Then since exp : U 0 U e is a diffeomorphism we can define a composition 
F: U 0 x U 0 ^g by 

F(X 9 7) = log(exp X exp Y ) 

for X , Y e Uq provided exp X exp YeU e . Thus £/ 0 with this composition 
becomes a local Lie group with 0 as the identity and — X as the inverse of X, 
Now for any X, Yeg and t sufficiently near 0 e R we have from formula (a), 

F(tX, tY) = tX+tY+y 2 [XY] + o(t% 

where the error term o(t 3 ) is also a function of A", Yeg. Actually, o(t 3 ) can 
be expressed quite nicely by a series which is contained in the Lie algebra 
generated by X and 7; that is, looking ahead to Chapter 6, we define a Lie 
subalgebra h of the Lie algebra g to be a subvector space of g such that for 
all X, Yeh we have [XY] e h. In particular if L(X , 7) denotes the subalgebra 
of g generated by X and 7; that is, 

L(X, Y) = : h is a Lie subalgebra of g and 7, 7 e /*}, 

thenL(7, 7) is spanned as a vector space by X , 7, [XY], [7[77]], [Y[YX]] 9 
etc. Thus we shall show that the above formula has the form 

F(tX,tY)=f^F i (tX 9 tY) 

i= 1 

where each F t (tX 9 tY)eL(X 9 7). This result is known as the “Campbell- 
Hausdorff theorem.” Note that since a subvector space is closed we also 
have F(tX , 1 7) £ L(X , 7). Consequently the multiplication in the neighbor¬ 
hood of U e is completely determined by the Lie algebra. We shall express this 
more accurately later by noting that two Lie groups are locally isomorphic 
if their corresponding Lie algebras are isomorphic. 

Exercises (1) As in the proof of Theorem 5.16 we can write exp tX' 
exp t Y = exp F(t), where 

Fit ) = tF 1 + t 2 F 2 + t 3 F 3 + oft 4 ). 

Show that the third-order term is 

F 3 =M[nXY]+[Y[YX]]}. 

(2) Let G be a Lie group with Lie algebra g = T(G 9 e). 

(i) If X, 7 e g are such that [XY] = 0, then show exp X exp 7 = 
exp(7 4- 7). A Lie algebra is called commutative or Abelian if [XY] — 0 for 
all X, Yeg. 

(ii) Let G be a connected Lie group with Lie algebra g. Then G is 
commutative if and only if g is commutative. 
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We shall now prove that Campbell-Hausdorff theorem as mentioned in 
remark (6); this proof is essentially that given by Eichler [1968]. Following 
his notation we shall call an expression G(X , Y, ..., Z) a Lie polynomial if 
G(X , Y,..., Z) e L(X, Y, ..., Z) which is the Lie subalgebra of # generated 
by Y, Y, ..., Z in g. Thus G(X , Y) is a Lie polynomial if it is a linear com¬ 
bination of X , Y, [IY], etc. An expression G(Y, Y, ...,Z) is homogeneous 
of degree k if for any s e R, G(sX, sY,... 9 sZ) = s k G(X , Y,..., Z). By pre¬ 
vious results of this section we have for X 9 Yeg and s near enough 0 e R 
that 

exp sX • exp sY = exp F(sX 9 s Y), 

where F is analytic at 6 = (0, 0) e g x g and has the series expansion 

o° v k F k (Q\ 

F(sX 9 sY) = I Y)<*\ 

k= 1 k\ 

Thus F\0)(X, Y) = X + Y and F 2 (0)(Z, Y) (2) = [YY] are Lie polynomials 
which are homogeneous of degree 1 and 2, respectively. We can now state 
the result as follows. 

Theorem 5.18 (Campbell-Hausdorff theorem) Let G be a Lie group 
with Lie algebra g. Let X, Y e g and let 5 be sufficiently near 0 in R so that 
the multiplication is represented locally by 

exp sX exp sY — exp F(sX, sY) 

where the analytic function F is given by the above series. Then each 
F k (6)(X 9 Y) (k) is a Lie polynomial in X and Y and is homogeneous of degree k. 

Proof We first note the following facts which we leave as brief exercises. 

(a) For X 9 Yeg and j near enough 0 e R, 

F k (6)(sX + o(s 2 ), sY) w = s k F\6)(X, Y) ,k > + o(s k+1 ). 

[Just use the multilinearity of F k (6) and write (sX + o(s 2 ), sY) = ( sX,sY ) + 
(o(i 2 ), 0).] 

(b) If G(X, Y,...,Z),H(X, Y,...,Z),...,K(X, Y,...,Z)eL(X, Y,...,Z)\ 

that is, Lie polynomials in X, Y, ..., Z, then G(H(X, Y,..., Z). K 

(X, Y, ... Z)) is also a Lie polynomial in Y, Y, ..., Z. 

(c) If G(Y, Y, ..., Z) is a Lie polynomial and G(X, Y, ..., Z) = 
£ k G k (XY, , Z), where the G k (X, Y, ..., Z) are the homogeneous com¬ 
ponents of degree k into which G decomposes, then each G k { Y, Y, ..., Z) 
is a Lie polynomial. 
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We shall now use the notation 

F k (sX,sY) = ^F k (d)(X, Y) ,k) 

and show by induction that each F k (sX , sY) is a Lie polynomial in X and Y. 
For k = 1, 2 this has already been done and we assume for all k < n that each 
F k {sX , s Y) is a Lie polynomial in X and Y. From the associative law 

(exp sX exp s Y) exp sZ = exp sX(ex p s Y exp sZ ) 

we obtain 

00 OO 00 GO 

I Fi( I Fj(sX 9 s n sZ) = X F&sX, £ Fj(s Y, jZ)). (1) 

i= 1 ;=1 i— 1 j= 1 

Expanding the left side of (1) we see [using (a) and Y) = X + Y] that 

the homogeneous term of degree n is 

s n {F n (X, Y) + |[F„_ 1 (Z, Y) Z] + • • ■ + F n (X + Y, Z)}. 

Thus by the induction hypothesis, the only possible term in this expression 
which might not be a Lie polynomial is 

s"{F n (X, Y) + F„(X 4- Y,Z)}. 

Similarly expanding the right side of (1) we see that the only possible homo¬ 
geneous term of degree n which might not be a Lie polynomial is 

s n {F n (X, Y + Z) + F n (Y, Z)}. 

This relation can be expressed as follows: We write U ~ V if U — V is a Lie 
polynomial. Thus we have 

F n {X, Y) + F n (X +Y,Z)~ F n (X, Y+Z) + F n ( Y, Z) (2) 

and using (2) we will show 

F n (X, T) ~ 0; 

that is, F„(X, Y) is a Lie polynomial in X and Y. 

We now compute. First from exp aX exp bX = exp(a + b)X we see 
F(asX, bsX ) = {a + b)sX so that for k > 1, 

F k (aX, bX) = 0 (3) 

and using exp X exp 0 = exp X we also have for k > 1 


F k (X, 0 ) = F k (0, X) = 0 . 


(4) 
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We now omit the s near 0 e R for the following computations and let Z = — Y 
in (2) to obtain 

F n {X, Y) + F n {X + Y, — Y) ~ F„(X, Y - Y) + F n ( Y, - Y) 
so that by (3) and (4), 

F n (X,Y)~-F n (X+ Y, -Y). (5) 

Let X = — Y in (2) to obtain 0 ~ F n ( — Y, Y + Z) + F n ( Y, Z). Thus replacing 
Y by X and Z by 7we obtain 

F n (X,Y)~ -F n (-X,X+ Y). (6) 

This yields 

FIX, Y)~ -F n (-X,X+ Y), using (6) 

~ -{F n (-X +(X+ T)), -{X + Y)), using (5) 

= F„(-(-T), ~(X+ Y)) 

= (- 1)"F„(- Y, X + Y), homogeneity 

~ -(-lfF n {Y,X), using (6). (7) 

Next Let Z = —$Y in (2) to obtain 

F„(X, Y) - F n (X + Y, -17) + F n (X, } Y). (8) 

Similarly let X= -±Y in (2) to obtain F n {\Y, Z) ~ F„(~iY, Y + Z) + 
F n ( Y, Z) so that replacing Y by X and Z by Y we obtain 

F„(X, Y) ~ F„(iX, Y) — F n { — \X, X + T). (9) 

We now use (8) on the two terms on the right of (9) to obtain 
F n {X, Y) ~ -F„(iZ+ Y,-iY) + mx, iY) + F n (iX + Y,-\X- \Y) 
-F n (-iX,iX+iY) 

= (i) n FJX, Y) - (h) n F n (-X, X + Y) — F n (\X + Y, -±T) 

+ F n (iX+ Y, -iX-iY). (10) 

Next 

F n (iX + Y, —\Y) - F n {\X + Y+(-±Y),lY), using (5) 

= -F n (i(X+ Y)AY) 

= -(i)"F„(Z+ Y, Y) (11) 

and 

FJ&X+ Y,-±X-±Y)~ -F n (±X + Y+(-$X-iY),iX+iY) 

=-(i) n F„(Y,X+Y) (12) 
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where we again use (5). Thus, since from (6) we have -F n ( — X, X + Y) ~ 
F n (X , Y ), we substitute (11) and (12) into (10) to obtain 

F n (X, Y) ~ {W~ l F n {X, Y) + (\) n F n {X +Y,Y)~ (M r, AT + Y). (13) 

Thus, using (7) we have from (13) 

[1 _ (#r~ l ]F n (X, Y) ~ (i)"(l + {-\T)F n (X + Y, Y). (14) 

In case n > 1 is odd we obtain the desired result F n (X, Y) ~ 0. In case n is 
even we replace X by X — Y in (5) and use (14) to obtain 

-F n (x, -Y)~F n (x- y, y) 

~ d)"(i + (-mi - ar l r l F„(x, y). (is) 
We apply (15) twice to obtain 

F n (x ; Y)= -(-F n (y,-(-y))) 

- -W + (-i)) rt [i - (irTW, - y) 

~(i) 2w (i + (-imi -ay-'rw, y). 

Since the coefficient on the right in (16) equals l/(2"~ i - l) 2 # 1, we again 
obtain F n (X \ Y) ~ 0. This completes the proof of the Campbell-Hausdorff 
theorem. 


Remarks (6) The above proof is a relatively straightforward computa¬ 
tion not involving much machinery. However, with more development the 
proofs in Jacobson [1962] and Serre [1965] give the following explicit for¬ 
mula (Campbell-Hausdorff formula): For a suitable neighborhood U 0 of 0 
in g we have for X, Y e U 0 that 


F(X, Y)= £ 


(- 1 ) 


71-1 


I 


x(X Pl , Y qi , X p \ Y q \ X Pn , Y q ") 


C(Pt,q» ...,p n ,q„) 


where the second sum runs over the integers Pi,q t > 0 with p t + qi> 1 for 
and 


C(Pl, <h ,•••,/>„,?„)=£ (Pi + <h)Pi ! • • • Pn'-QnU 

i=l 

x{X p \ Y q \ x p ", y«") =[[••• [[[X Pl Y q '\x pi ] Y" 2 ] ■ ■ ■ X Pn ] Y q " 
where we use the notation 


[U r V^=[[-[\S L UU]U]-U]V] -V] 

N___X '-.-' 

r s 


for k > 1 and r(A rl ) = X, t(Y x ) — Y. 
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If we let ad(X ): g -> g : Y -* [XY], then the above expression for t is 
z(X Pl , Y«‘,..., X p ", Y q ") = (ad X) Pl (ad y)«» • • • (ad T)''" -1 y 
if < 7 „ > 1 and 

T(y p ‘, y«\ ...,X P ") = (ad y) p ‘(ad T)" 1 • ■ ■ (ad XY'-'X 

if q„ = 0. From these formulas observe that if q n > 2, or if q n = 0 and p„ > 2, 
then i(X p \ X Pn , Y q ") is 0. Thus possible nonzero terms occur only when 
q„ = 1 or when q n = 0 and p n = 1. 

Exercise (3) Compute the term F\S){X, Y) w / 4! using the Campbell- 
Hausdorff formula. 


4 . Homomorphisms and Analytic Structure 


We use previous results of this chapter to discuss elementary facts on 
homomorphisms from which we obtain the fact that the topology of a Lie 
group uniquely determines its analytic structure. 


Proposition 5.19 Let G and H be Lie groups with Lie algebras g and h, 
respectively, and let f :G->H be a group homomorphism which is an 
analytic map of manifolds; that is, an analytic homomorphism. Then 
77(e) : g -*h is a Lie algebra homomorphism and for Xeg 

/(exp G X) = exp H [(77(e))(X)]. 

Thus for e the identity in H, the accompanying diagram is commutative. 


T(G, e) 

expo 

* 

G 


( Tf)(e ) 


/ 


T(H, e) 

exp h 

H 


Proof For X e g the map 

<j) : R^H: t-* /(exp tX) 

is an analytic homomorphism and for cc(t) = exp tX we see 
4>(0) = [T(fo cc)(d/du)](0) 

= (Tf)(e)d(0) = (Tf)(e)X. 




136 


5. THE LIE ALGEBRA OF A LIE GROUP 


However P: R-+ H : t —> exp ;f t(Tf{e)X ) is an analytic homomorphism and 
$(0) = Tf(e)X. Thus by the uniqueness of Theorem 5.9 we obtain/(exp tX) = 
ex P t(/7((’)T). Next, since Tf(e)\ g -*h is a vector space homomorphism, we 
must show Tf(e)[X T] = [Tf(e)XTf(e) 7] for all X,Yeg. Notice that the two 
maps p : G x G^H: (x,y)^f(x~ 1 y~ 1 xy) and g : G x G ^ H: (x, y) 
/(*ry<>ry(*)/(y) are equal. Thus for t near OeR we apply this to 
x = exp tX, y = exp t Y and use Theorem 5.16(c) together with the formula 
of the preceding paragraph to obtain Tf(e) as a Lie algebra homomorphism. 

Exercises (1) Let /: G —> 77 be an analytic homomorphism of Lie 
groups. Then Tf(e):g-*h is surjective (injective) if and only if Tf(a): 
T(G, a) -> T(H,f(a )) is surjective (injective) for all a e G. Thus/is an immer¬ 
sion or submersion depending only upon the value Tf(p) at the single point 
p = e. 

(2) If in (1) the map/: G-> H is injective, then / is regular; that is, 
Tf(a) is injective for all aeG. (Note Proposition 5.19.) 

(3) If G is a commutative connected Lie group with Lie algebra g, then 
exp . g —* G is surjective. (See exercises in Section 5.3 and regard exp as a 
homomorphism of a commutative groups.) 

Remark (1) From the preceding results we see that an analytic iso¬ 
morphism of Lie groups implies an isomorphism of the corresponding Lie 
algebras. The converse is true locally, and globally if the groups are simply 
connected. Consider the locally isomorphic groups R and T 1 for a counter¬ 
example. It is this converse which allows a classification of certain Lie groups 
by classifying their Lie algebras. 

Proposition 5.20 Let a, /? be two analytic homomorphisms of the Lie 
group G into the Lie group H. If Tbt(e) = T[l(e), then there exists an open 
neighborhood U of e in G on which a and p are equal. Furthermore, if G is 
connected, then a = p on G. 

Proof Let (U e , log) be a canonical chart at eeG and set U = U e . 
Then from Proposition 5.19 we have for a = exp Xe U, that 

a(a) = a(exp(X)) 

= exp[(7a(e))(Z)] 

= exp[(77?(e))(Z)] = p( a ) 

which proves part of the result. The other part follows from the fact that U 
generates G and a, p are homomorphisms. 
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The first fact needed to prove that the topology of a Lie group determines 
is analytic structure is the following result. 

Lemma 5.21 Let G be a Lie group with Lie algebra g, let /: R -> G be a 
homomorphism of the additive group R into the group G, and let / be a 
continuous map of the corresponding topological spaces. Then these exists 
Xeg such that f(t) = exp tX. Thus a continuous homomorphism/: R -► G is 
actually analytic. 

Proof Let ( U , log) be a canonical chart at e e G and let V be a neigh¬ 
borhood of e in G with VV c U. We shall now show that the map V -> U : 
a -► a 2 is injective. For if ae V, then a 2 e U so that log a and log a 2 are de¬ 
fined. Thus cj) : R->G : t -+ exp (t log a) is a homomorphism such that 
a 2 = exp(2 log a). However, since a 2 e U, a 2 = exp(log a 2 ) and since exp| U 
is injective, 2 log a = log a 2 . Thus a = exp(log a) = exp(^ log a 2 ) so that 
a 2 e U uniquely determines a e V. Thus “ square roots ” exist and are unique. 

Next since the given homomorphism /: R -> G is continuous, there exists 
e > 0 such that f(t) e V if |t| < e. Now we can assume £ = 1 otherwise we 
can make a change of parameter to s = Xt where now f{s) is defined for 
| ^ | < 1. Using this let 

a=/( l)eV and X= log a eg. 

From the preceding paragraph exp \X is the unique square root of a e V. 
Thus since/(i) is also a square root of a = /(l) we obtain by uniqueness 
fd) = exp \X. Applying this argument to fd) and taking its unique square 
root we obtain /(£) = exp \X and therefore log/(i) = \X. Using induction 
one obtains /(1/2") = exp(l/2 n T) and for any integer p we have since/is a 
homomorphism 

/0>/2") =/(l/2") p = exp((l/2")Z) p = e\p((p/2 n )X). 

Thus for every dyadic rational number q we have f(q) = exp qX and by 
continuity /(f) = exp tX for all t e R. 

Analogous to Corollary 3.15 we have the following result. 

Exercise (4) Let /: G-* H be an (algebraic) homomorphism of the Lie 
groups G and H. If/ is analytic at e e G, then/is analytic on all of G. 

Theorem 5.22 Let G, H be Lie groups and let /: G -> H be a continuous 
homomorphism of the corresponding topological groups. Then/is an analytic 
homomorphism of the Lie groups. 
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Proof Let g be the Lie algebra of G and let g = g x + • • ■ + g m be a sub¬ 
space direct sum of the one-dimensional spaces g t = RX, where X lt ..., X m 
is some basis of g. Let exp ; = exp | ^ and let 

( p:g^G: A t -\ -+ A m -»exp A x ■ • • exp A m 

be the map which gives the canonical coordinates of the second kind where 
A i = t i X i eg i [see remark (4), Section 5.3]. Then, since 

f° <K A i + "' + A m) =/(exP j A t ■•■exp m A m ) =/(exp 1 A 1 )---/(exp„AJ, 

we have an expression for fo $ : g -► H in terms of analytic functions, 
applying Lemma 5.21 to each /° exp* for i = 1, ..., m. Thus/o $ : g-+H is 
analytic. However <p is a local analytic diffeomorphism of a neighborhood 
U 0 of 0 in g into a neighborhood U e of e in G. Therefore, since (j)" 1 : U e -> U 0 
is analytic, /= (fo <£) ° </> _1 i s analytic on U e ; that is, / is analytic at e e G, 
and by the preceding exercise/is analytic on all of G . 

Corollary 5.23 Two real Lie groups which are isomorphic as topological 
groups are actually isomorphic as Lie groups. 

Remark (2) Let G be a Lie group with analytic structures and stf 2 
which give topologies S’ x and S ’ 2 . Let 9 x denote the Lie groups (G, and 
let Gi denote the topological groups (G, S’ i ). Then G x £ G 2 as topological 
groups implies 9 X s ^ 2 as Lie groups; that is, the continuous isomorphism 
which expresses G 1 ~ G 2 is actually analytic and therefore an analytic iso¬ 
morphism which expresses <$ x ^<0 2 . Thus the analytic structures are uniquely 
determined. In particular, if we say G x = G 2 via the continuous identity 
isomorphism, then 9 X = ^ 2 as Lie groups. 



CHAPTER 6 


LIE SUBGROUPS AND SUBALGEBRAS 


In this chapter we consider Lie subalgebras of a Lie algebra and show their 
basic relationships with Lie subgroups. Thus each Lie subgroup yields a Lie 
subalgebra and conversely each Lie subalgebra is the Lie algebra of a Lie sub¬ 
group. Next we show that two Lie groups are locally isomorphic if and only 
if their Lie algebras are isomorphic; in Chapter 8 we extend this to a global 
result. In the third section we prove the very useful result that an abstract 
subgroup of a Lie group, which is a closed subset, is actually a Lie subgroup. 
Next we discuss homogeneous spaces GjH where H is a closed subgroup of 
the Lie group G and show how to coordinatize GjH using the exponential 
mapping, Then we apply these results to quotient groups. Finally we show 
that a commutative connected Lie group is isomorphic to R q x T p where R q 
is a ^-dimensional Euclidean space and T p is a p-dimensional torus. 


1. Lie Subalgebra and Uniqueness of Analytic Structure 


From the various characterizations of a Lie subgroup H of a Lie group G, 
we see that the Lie subgroup H cz G must satisfy: 

(1) H is a Lie group; 

(2) the injection/: H-* G : x -»x is an analytic immersion. 

Thus (Tf)(b ): T(H, b) -► T(G, b ) is injective for every be H. We use Tf(e) to 
identify the Lie algebra of H with a subalgebra of the Lie algebra of G and 
show how a Lie subalgebra can be used to generate a Lie subgroup of G. 
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Definition 6.1 Let g be an (abstract) Lie algebra; that is, g has an 
anticommutative bilinear multiplication [AT] such that the Jacobi identity 
holds (Section 1.6). Then a subvector space hag is a Lie subalgebra if for all 
X , Yeh we have [XY] e h. 

Proposition 6.2 Let H be a Lie subgroup of the Lie group G given by 
f:H-+G as above. Then Tf(e ) : T(H, e) -> T(G, e) is an injective homomor¬ 
phism. Thus we can consider the Lie algebra of H as a Lie subalgebra of the 
Lie algebra of G. 

Proof We just note that the identity map/ :i/->G:;e->;cisa homo¬ 
morphism and therefore Tf(e ) is a Lie algebra homomorphism. Also, by 
definition of immersion. Tf(e ) is injective. 

Corollary 6.3 Let H be a Lie subgroup of a Lie group G and let h , g be 
the corresponding Lie algebras and regard hczg. Then exp w = exp G | h. 

Proof Let f :H^>G:x-+x, Then from Proposition 5.19 we have for 
X 6 T(H, e), 

exp,, X =/(exp H X) = exp a (Tf(e)X). 

Thus after the above identifications, we obtain the result. 

Proposition 6.4 Let G be a Lie group with Lie algebra g and let H be a 
Lie subgroup of G with Lie algebra h. Then h = {X e g : all t e R, exp tXe H 
and R->G : t exp t X is continuous}. 

Proof For Xeh, we have from Lemma 5.21 that a :R-+H:t^> 
exp H tX = exp G tX is analytic which gives one inclusion. Conversely, suppose 
for Xe G that the map a : R-+ H \ t — ► exp G tX is continuous. Then from 
Lemma 4.9 (with/= exp G , M = R, N = G, and P = H), we see that a is ana¬ 
lytic. Thus sincea(f) e H witha(0) = ee H, we obtain X = d(0) e T(H , e) = h. 

Exercise (1) Let H 1 and H 2 be connected Lie subgroups of a Lie group 
G, and let h t and h 2 be the corresponding Lie subalgebras. If h x —h 2 , then 
H x = H 2 as Lie groups. 

From the above results or from Corollary 5.23, we have the following: 

Corollary 6.5 If H 1 and H 2 are two Lie subgroups of the Lie group G 
such that H t and H 2 are equal as topological groups, then they are equal as 
Lie groups. 
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Theorem 6.6 Let G be a Lie group with Lie algebra g . If H is a Lie sub¬ 
group of G with Lie algebra h, then h is a Lie subalgebra of g. Conversely, 
for each Lie subalgebra h of g , there exists a unique connected Lie subgroup 
H of G which has h as its Lie algebra. This Lie subgroup H is the smallest 
Lie subgroup of G containing exp h. 

Proof From the previous results, it suffices to prove the converse. First, 
the uniqueness follows from the above exercise. We now use the Campbell- 
Hausdorff theorem to obtain a local Lie group B of G as follows. Let m be a 
subspace of g such that g = m + h (subspace direct sum). Then using can¬ 
onical coordinates of the second kind [Section 5.3, remark (4)], there exist 
open, connected, symmetric neighborhoods V (respectively W) of 0 in h 
(respectively m) and a neighborhood U of e in G such that the map 

(j ): W + V-> U : X + exp X ■ exp Y 

is an analytic diffeomorphism. Now let B = exp V where V <= h. Then since 
h is a subalgebra, we have for all Y and Z in a suitable neighborhood of 0 in 
h, F( 7, Z) = log(exp Y exp Z)eh which uses the Campbell-Hausdorff 
theorem. Thus using Theorem 5.16 we see that B is a local Lie group relative 
to the operations in G (brief exercise). Note that the topology on B is given 
by prescribing as a family of neighborhoods of ee B the system of sets 
exp S where S ranges over the neighborhoods of 0 in V. 

Next, using the canonical coordinates of the second kind as given above, 
we see ( U , </> -i ) is a chart at e in G. Also defining y : B V : exp Y -> Y for 
Y e h, we obtain a chart (B , y ) of e in B such that <j> “ 1 1 B = y and the Jacobian 
matrix {d{4>~ l )ijdyj{e)) has rank equal to dim B (= dim h). Thus by Corollary 
4.13, B generates a connected Lie subgroup H of G. 

Let <£(H) denote the Lie algebra of H. Then from Proposition 6,4 we see 
h c However, since dim h = dim B = dim H = dim £?(H), we have 

equality. 

Remark (1) We can use Theorem 6.6 to show that an (abstract) finite¬ 
dimensional Lie algebra L over R occurs (up to an isomorphism) as the Lie 
algebra of some Lie group. First we state Ado’s theorem (see Jacobson’s proof 
[1962]). 

Ado’s Theorem Let L be a finite-dimensional Lie algebra over a field K . 
Then there exists some finite-dimensional vector space V over K such that L 
is isomorphic to a Lie subalgebra h of gl(V). 

Now gl(V) is the Lie algebra of GL(V), so that the connected subgroup 
H of GL(V) generated by exp h is a Lie group with Lie algebra h isomorphic 
to L. Note that this does not give much information on L but just says that 
gl{V) and GL(V) are rather complicated. 
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2. Local Isomorphisms 


We now determine what type of information about a Lie group can be 
obtained from its Lie algebra—roughly those properties preserved by a Lie 
algebra isomorphism. 

Proposition 6.7 Let /: G -*■ H be a local homomorphism of the Lie 
groups G and H. Let g and h be the corresponding Lie algebras and assume 
the Lie algebra homomorphism (Tf)(e) :g->h is bijective. Then/is a local 
isomorphism. 

Proof Since Tf(e) is bijective, we have by the inverse function theorem 
neighborhoods V of e in G and V of e in H such that/: t/-» Fis an analytic 
diffeomorphism. However, since / is a local homomorphism, we have on 
suitable neighborhoods that the analytic inverse of/is also a local homo¬ 
morphism; that is,/is a local isomorphism. 

Theorem 6.8 Let G and H be Lie groups with Lie algebras g and h. 

( a ) If/: 9 -»h is a Lie algebra homomorphism, then there exists a local 
Lie group homomorphism <f> :G^>H such that (T(p')(e) =/ 

(b) G and H are locally isomorphic as Lie groups if and only if g and h 
are isomorphic as Lie algebras. 


Proof Part (b) follows from (a), Proposition 6.7, and the obvious 
modification of Proposition 5.19. For (a) we follow the proof of Chevalley 
[1946, p. 112] and note that k = {{X,f{X))\ Xeg} is a Lie subalgebra of 
g x h as in exercise (2) in Section 5.1. Let K be the connected subgroup of 
G x H with Lie algebra k (Theorem 6.6), and let n : G x H -► G : (x, y) -► x 
be the analytic projection map. Now the map r = n | K : K-+ G is a Lie group 
homomorphism and ’ 

(Tr)(e, e):k^g:(X,f(X))->X 

is actually a Lie algebra isomorphism since X = 0 implies (X,f(X)) = (0, 0) 
and the map is clearly surjective. Thus by Proposition 6.7, r is a local iso¬ 
morphism with local inverse s : V W where V c G and W c K are suitable 
neighborhoods of the corresponding identity elements. Also we have 
(Ts)(e): g->k and 

Ts(e)X = (Ts)(e)[(Tr)(e, e)(X, f(X))} 

= T(s o r)(e, e)(X,f(X)) = (X,f(X)). 
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Now let p : G x H^> H be the natural analytic projection. Then p is a 
Lie group homomorphism. Thus if we let 


<j) = p o s : G -> H, 


we obtain the desired local homomorphism because 

( T<t>)(e)X=T(pos)(e)X 

= ( Tp)(e , e)(X 9 f(X)) =f(X). 

Remarks (1) As previously stated, we shall use simple connectivity to 
obtain the following result: Let G be a connected and simply connected Lie 
group, and let/: G-+Hbea local homomorphism of G into a Lie group H. 
Then there exists a unique extension of / to a (global) homomorphism of G 
into H. 

(2) The proof of Theorem 6.8 uses indirectly the Campbell-Hausdorff 
theorem, and we now sketch a more direct approach. Using the notation of 
Section 5.3, let U 0 be a neighborhood of 0 e g and U e a neighborhood of e e G 
such that we have the composition for X> Ye U 0 , 

exp X exp Y = exp F(X , 7), 

where F(X, Y) = X + Y + i[XY] + - - - + 1/A:! F k (6)(X, Y) w + • • • is given 
by the Campbell-Hausdorff theorem. From this theorem each term 
F k (6)(X, Y) w is a Lie polynomial in X and Y. Therefore, for the homo¬ 
morphism/: g^hwe have f[F k (0)(X, Y) w ] = F k {0){f{X),f(Y)) ik \ using the 
definition of a Lie polynomial. Because /is linear (and continuous) this gives 

f[F(X, Y)] = F(f(X),f(Y)). 

We can assume that U 0 is small enough in g so similar results hold in some 
neighborhood of 0 eh which contains f(U 0 ). Now for x = exp X e U e , define 
4> by 

4>(x) = exp H f(X) 

and note that for x = exp X, y = exp YsU e 

(j>(xy) = $(exp F(X, Y)) 

= exp H f[F(X, Y)] 

= exp H F(f(X)J(Y)) 

= exp H f(X) ■ exp h /(Y) = <p(x) ■ 4>(y). 


Also we see T<p(e)X = f{X). 
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3. Closed Subgroups 


We have previously given criterion for a subgroup H of a Lie group G to 
be a Lie subgroup, and we now give an extremely useful criterion concerning 
a closed subgroup. 

Theorem 6.9 Let G be a Lie group, and let H be an (abstract) subgroup 
of G which is a closed subset of the topological space G. Then there exists a 
unique analytic structure on H such that H is a Lie subgroup of G and the 
topology on H induced by this analytic structure is the topology induced by 
G ; that is, H is a topological subgroup of G. Furthermore the Lie algebra of 
H equals {X e g : exp G tXeH for all t e R}. 

Proof First note that the uniqueness of this analytic structure follows 
from Corollary 6.5. From the definition of a Lie subgroup, it suffices to show 
H is an analytic submanifold of G. We shall use the various group properties 
and closure, and separate the proof into several parts. 

(a) Let h = {X e g : exp tX e H for all t e R}. Then h is a Lie subalgebra 
of g. For X eh implies sX eh for anyiefi. Next, for X, Y eh, we have from 
the formulas (/) and (g) of remark (1) following Theorem 5.16 that for any 
t e R, exp t(X + Y ) 6 H since H is closed. Thus X + Y e h and similarly 
[XY] e h so that h is a Lie subalgebra of g. 

(b) Let m be a subspace of g such that g = m + h (subspace direct sum). 
Then there is a neighborhood D of 0 in m such that 0 # Xe D implies 
exp X $ H. Suppose this is false. Then there is a sequence { Y t } cz m such that 
lim Y t = 0 and exp 7 ; e H. Now we can regard m as a Euclidean space with 
norm || || and let k = {Xe m : 1 < ||7|| < 2}. Then we can choose integers 
n t such that X t = n t Y t ek. Thus since k is compact, we can assume that 
O^X lim Xj exists in k c: m (passing to a subsequence if necessary). 
Furthermore, since lim Y t = 0, we see that lim l/n, = 0 and exp(l /n t X) = 
exp YieH. 

We shall now show that the above X is in h and obtain the contradic¬ 
tion Xemnh = { 0). Thus let t t = 1/n, so that lim t, = 0. Then since 
ex p(-1; Xf) = (exp I, X t ) 1 e H, we can assume f ; > 0. Similar reasoning 
shows that X e h if exp t X e H for all t > 0. Now for each t > 0, let 

k/(t) = largest integer < tjt x . 

Then since (f/f,) - 1 < k x (t) < t/t x and lim t t = 0, we see 

lim j = t. 
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Since exp t { X t e H, we see exp k ( (t)*i X t = [exp t,Xi] k,w e H. However, since 
exp is continuous and lim X = tX, we obtain 

exp tX = lim exp fCi(t)t t X t e H. 

Thus by definition of h in (a) we have the contradiction X e h. 

(c) For some neighborhood U in G with e e U, we have 

U r\ H = U r, exp h = exp F 

for some neighborhood F of 0 in h. For using the coordinates of the second 
kind relative to g = h + m, we can find neighborhoods F (respectively V') of 
0 in h (respectively m) and U of e in G such that for W = F + F' the map 

<j) : W-* U \ X + X' -> exp X exp X' 

is a diffeomorphism. Now for x e U = <j>( W), we have x = exp Xexp X' with 
XeVczh and X’ eV' <= m. Thus if xe 17 n H, then x = exp Zexp X'e H 
implies exp X' e H since H is a group. Thus from (b) (shrinking V to D if 
necessary) we see that X' = 0; that is, U n H = U n exp h = exp V. 

(d) The set H is an analytic submanifold of G. To show that H is the 
underlying set of a submanifold, we use the converse of Proposition 2.11. 
Thus for pe H, we must show that there is a neighborhood U in G with peU 
such that UnH is a submanifold of U; that is, there is a coordinate function 
z so that ( U , z) is a coordinate system of G at p which satisfies (a) and (b) of 
Proposition 2.11. However, we can multiply by p 1 and translate the situa¬ 
tion to e e G. But these results follow from (c) above when we take z x ,.. •, 
z m ,..., z„ to be canonical coordinates on U as given in (c), and we take the 
neighborhood W (in Proposition 2.11) to be exp V= U n H. Also because 
U is open in G, this formula yields the topology, induced by the analytic struc¬ 
ture, equals the topology induced on H by G. With this induced topology, we 
see that the map R-* H\t~* exp tX for X eh is continuous, and therefore 
from Proposition 6.4, h equals the Lie algebra of H. 

Remark (1) Let H be a closed Lie subgroup with Lie algebra h as given 
above, and let e > 0 be given. Then 

h = {Xe g \ exp sX e H for all s e R with | s\ < e}. 

For any given t e R, there exists m so that t = £”= ^, where | s t | < e and 
we have exp tX = exp X • • • exp s m X € H. 

Definition 6.10 Let A denote a finite-dimensional nonassociative algebra 
over R. Thus A is a finite-dimensional vector space with a bilinear map 
a : A x A -> A. Let B be another finite-dimensional (nonassociative) algebra 
with multiplication function /?. 
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(a) A linear transformation F : A —► B is an algebra homomorphism if 
F(a(Z, Y)) = p(F(X), F(Y )) for all X, Ye A. 

(b) The kernel of an algebra homomorphism F: A -* B is the set ker F = 
{X e A : F(X) = 0}. 

(c) A subspace D of an algebra A is an ideal if for all Xe A, 

<x(X, D) = {<x(X, Z): Z e D} <= D 

and 


<x(D, X) = {oc(Z, X):ZeD}<=D. 

Exercise (1) Analogous to the case for associative algebras show: D is 
an ideal of A if and only if D is the kernel of some algebra homomorphism 
F: A->B. 

Proposition 6.11 Let G and H be Lie groups with Lie algebras g and h 
and let/: G-»H be a Lie group homomorphism. 

00 Ker/ is a closed normal Lie subgroup of G and the Lie algebra of 
Ker/ equals ker(7/(<?)) which is an ideal in g where Tf(e): g -> h is the cor¬ 
responding Lie algebra homomorphism. 

(b) If G is connected, then Im(/), the image of / is a Lie subgroup of H 
and the Lie algebra of Im(/) equals lm(//{/)). 

Proof (a) Let e be the identity element in H. Then Ker(/) = {xeG: 
f(x) = e} =f~\e) is a normal subgroup which is closed in G. Thus by Theorem 
6.9, Ker(/) is a Lie subgroup and its Lie algebra 

^f(Ker/) = {Xeg : /(exp tX) = e for all t e R}. 

Thus X e ^(Ker/) if and only if e =/(exp G tX) = exp H (tTf(e)X) for all t e R. 
This implies Tf(e)X = 0 using exp as injective for t near enough 0 e R ; that 
is> ^f(Ker/) c ker(7/(e)). From the same formulas, the other inclusion is 
clear. Also by the preceding exercise, ker(7/(e)) is an ideal. 

For (b) note that since Tf(e) is a Lie algebra homomorphism, k = \m(Tf(e)) 
is a Lie subalgebra of h. Thus let K be the connected Lie subgroup of H with 
Lie algebra k. Then K is generated by elements exp(7/(e)A) with Xeg. 
However, Im(/) =/(G) is generated by elements /(exp X) for Xeg, since G 
is connected. Therefore since/(exp X) = exp(Tf(e)X), we have K = f(G) since 
both of these subgroups are connected. 

We shall consider the corresponding quotient groups in the next section 
on homogeneous spaces. 
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Exercises (2) Let G and H be connected Lie groups and let/: G-* H 
be a Lie group homomorphism and let Tf(e) : g -* h be the corresponding 
Lie algebra homomorphism. Then show: 

(i) / is surjective if and only if Tf(e) is surjective; 

(ii) / is injective implies Tf(e) is injective. What about the converse ? 

(3) Let G be a Lie group and H an open subset which is a subgroup of 
G. Then show H is a Lie subgroup of G such that G 0 = H 0 (identity com¬ 
ponents). 

Remark (2) Using the results of this chapter, we briefly review the 
subgroups SL(n), SO(n), etc. of GL(n, R) discussed in Section 2.3. Thus let 
R* be the multiplicative (Lie) group of the nonzero real numbers. Then 
det: GL(n, R) -* R* is an analytic homomorphism of Lie groups. From this 
SL(n) = Ker det = {A e GL(n, R) : det A = 1} is a closed subgroup of 
GL(n, R) and consequently a Lie subgroup (see Proposition 6.11). From 
det(exp A) = e ,rA we see det ° exp = exp ° tr and using the chain rule we 
obtain T(det)(7) = tr. Thus from Proposition 6.11 the Lie algebra of Ker det 
equals ker T(det)(/) = ker tr = {A e gl(n, R): tr A = 0} = sl(n). 

Let V be an //-dimensional vector space over R and let B: V x V-+ R be 
a nondegenerate bilinear form on V. For G = GL(V) let 

H = {A e G : B(AX, A Y) = B(X, 7) for all X, Ye V). 

Then H is a closed subgroup of G. Thus by Theorem 6.9, H is a Lie subgroup 
of G. Let h be the Lie algebra of H. Then, from Theorem 6.9, Aeh \f and 
only if B((exp tA)X, (exp tA)Y) = B(X, 7) for all t e R and X, Ye V. Thus 
as before we obtain 

h-{Aegl(V) : B(AX, 7) + B(X, AY) = 0 for all X, Ye V}. 

In particular, when B is positive definite and symmetric, we obtain the 
orthogonal group H = 0(ri). Next since /= det: O(n) —> R* is an analytic 
homomorphism we have the special orthogonal group SO(n) = Ker/ Thus 
if superscript ‘ denotes the transpose, the Lie algebra of SO(n) equals 
ker 7/(7) = [A e gl(n) : A' = -A} = so(n). As before dim SO(n) = n(n - l)/2 
and since SO(n) is connected we have Theorem 6.6 that SO(n) is the unique 
connected Lie group generated by exp so(n). Recall from exercise (5). Section 
3.2 that 0(n) is compact. Thus SO(n) is compact since it is closed in O(n). 

Similarly the other groups SO(p, q ) and Sp(p, R) are Lie groups with Lie 
algebras so(p,q) and sp(p, R) as discussed in Section 2.3. Are these groups 
compact? 
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4. Homogeneous Spaces 

In this section we consider the coset space G/H and make it into an ana¬ 
lytic manifold so that the projection it : G —* G/H is analytic and the action of 
G on G/H given by G x G/H-> G/H: (a, xH) -> axH is also analytic. From 
a Lie algebra decomposition g = m + h we use m to coordinatize G/H and 
give examples. Then we apply these results to quotient groups. 

Thus let H be a closed subgroup of G so that H is a topological Lie sub¬ 
group according to Theorem 6.9. Consequently from Theorem 3.11, G/H is a 
Hausdorff space relative to the quotient topology and the natural projection 
n:G -*■ G/H is open and continuous. Now let g and h be the Lie algebras of 
G and H, and let m be a subspace of g so that g = m + h (direct sum). Then 
we use the canonical coordinates of the second kind to prove the following 
result [Helgason, 1962, p, 113]. 

Lemma 6.12 There exists a compact nucleus D of 0 in m and there exists 
a compact set N containing a neighborhood of e = eH in G/H such that 
exp | m : D -> exp(D) is a homeomorphism, and n : exp(D) -> N is a homeo- 
morphism. Thus n ° exp|/w : D — * N is a homeomorphism. 

Proof As in the proof of Theorem 6.9 we use canonical coordinates of 
the second kind. Thus relative to g = m + h (direct sum) we can find neigh¬ 
borhoods W (respectively W') of 0 in m (respectively h) and a neighborhood 
U of e in G such that 

<p(W + W') n H = U n H = exp W', 

where U = (j>(W + W') is a coordinate neighborhood of e in G and </> is given 
by </>{X + X') = exp X exp X'. Now let ^ = exp| m and let D <= W be a 
compact nucleus of 0 in m such that exp(- D) exp D <= U. Then the restriction 
•A : D -> i //(D) is a homeomorphism. 

Set N = 7r(i KD)). We now show n : ip(D) -> TV is injective. For let X, Ye D 
and assume 7r(exp X) = ji(exp Y). Then (exp X)H = (exp Y)H so that 

exp(- X) exp YeH n U = exp W'. 

Therefore there is Z e W' so that 

exp X exp Z = exp Y exp 0. 

However, since we are using coordinates of the second kind we have that 
</>\W+ W' is injective. Thus Z = 0 which implies n : ijj(D) -»■ N is injective. 
Thus n\i//(D) is a continuous injective map of the compact set <{/(D) onto 
TV and consequently a homeomorphism. 
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Note N is compact but it also contains an open neighborhood of e as 
follows. Since D contains an open neighborhood of 0 in m, D + W' contains 
a neighborhood of (0, 0) in W + W f . Therefore exp D exp W' contains an 
open neighborhood of e in G and since n : G —► GjH is an open map, 
n(\j/(D)) = 7 r(exp D exp W') contains an open neighborhood of e in GjH. 

Remarks (1) Let N° be the interior of the set N = n{\jj(D )), let D° be 
the interior of D , and let k = n\\p(D). Then for a fixed basis X x ,..., X r of 
m the map 

i)/' 1 ok _1 : N° -> D° : 7r(exp £ *i X i) Z X t x t 

is a homeomorphism which defines a chart ( N° , x) at e e G/H where x = 
x//' 1 o k' 1 . Consequently we can use this to define an analytic structure (on 
N°). 

(2) Using the coordinates of the second kind as above we see the map 
g -► G : X + X' -+ exp X exp X ' 

is an analytic diffeomorphism of a suitable neighborhood of 0 onto 
E = exp D° exp W'. Thus the map a : E m : exp X exp X ' ^ X is analytic. 
However, on E we see n = x' 1 °(ris analytic where x is the coordinate map 
above. 

Next, using N° = n(E) with E as above, we have for p = exp X exp X' e E 
that x : N° -+ m : pH -► X. Thus the map r = exp ° x : N° -► G is analytic. 
Furthermore (n ° r)(pH) = (n o exp o x)(pH) = 7r(exp X) — pH; that is, 
n o r = identity | N°. Combining these results we obtain that there exist neigh¬ 
borhoods E of e in G and N° of e in GjH so that: 

(i) 7 i: £ N° is analytic; 

(ii) there is an analytic map r : N° -> G so that n ° r = identity | N°. 

We apply these results to show the following. 

Theorem 6.13 Let G be a Lie group, let H be a closed (Lie) subgroup, 
and let G/H have the quotient topology as before. 

(a) Then G/H has a unique analytic structure such that: 

(i) the projection map n : G -> G/H is analytic; 

(ii) every p e GjH has a neighborhood P and an analytic map r : PG 
so that n o r = identity | P. 

(b) With the analytic structure in (a) on G/H we have for every aeG that 
the map t (a) : G/H GjH : xH^axH is an analytic diffeomorphism. 
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(c) With the analytic structure in (a) on G/H the map <j>: G x G/H -> 
G/H : (a, xH) —> axH is analytic. 

Proof First we assume (a) holds and prove (b). Since t (a) has inverse 
r(a -1 ) it suffices to show x(a) is analytic. Then is also analytic. We now 
note for x e G that 

x(a) o n(x) = axH = L(a)(x) 

so that on G we have x(a) °n = n° L{a). Therefore for x = xH e P as above, 
we see r(a)(x) = x(a) ° (identity)(x) = x(a) ° n <= r(x) = n ° L(a) ° r(x). Thus on 
P we see x(a) = n ° L(a) o r is a composition of analytic maps and therefore 
is analytic. 

This implies that it is sufficient to prove the existence and uniqueness of 
the analytic structure at the point e = eH in G/H, for if a coordinate system 
exists on a neighborhood N° of e, then we obtain a coordinate system at any 
other point p = pH by translation via the analytic map x(p). Futhermore if 

(1) and (ii) hold for N°, then they hold for x(p)N° = P. Briefly, if r satisfies (ii) 
for JV°, then r = L(p) ° r ° x (p -1 ) satisfies (ii) for P using x(p) o n = n° L(p). 
The proof is similar for (i). 

Now from the preceding remarks we have shown the existence of an 
analytic structure at e satisfying (i) and (ii) of (a) and we prove uniqueness. 
Suppose N° has two analytic structures with two coordinate maps x x and x 2 
and maps and r 2 from E into G satisfying (a) where E is given in remark (2). 
We shall show x x ° xj 1 is analytic. Thus x t => xj 1 = x 1 ° n ° r 2 <= xrj 1 . How¬ 
ever from (i), Xj ° n is an analytic map of G into some Euclidean space and 
from (ii), r 2 ° x 2 _1 is analytic. Similarly x 2 ° x^ 1 is analytic so that from 
Section 2.2 we see that the analytic structures are equivalent. 

To show $ : G x G/H - G/H : (a, xH) - axH is analytic, it suffices to 
show that its restriction to G x N° is analytic. Thus for r as defined in remark 

(2) above and for p : G x G -> G the multiplication function on G, we see 
(j)\G x N° can be factored as follows. 

G x N° GxG G —► G/H, 

and since these are analytic, so is $ = n ° p ° (i G x r). 

Remarks (3) Let G be a Lie group and let M be an analytic manifold 
such that G is a topological transformation group on M (Section 3.2). Then 
G is a Lie transformation group of M if the map G x M-> M : (a, x)-> ax is 
analytic. Thus in the above theorem G is a Lie transformation group of G/H. 
Furthermore, it is proved by Helgason [1962, p. 113] and Tondeur [1965, 
p. 155] that the above analytic structure on G/H is the unique analytic structure 
such that G (with the above action) is a Lie transformation group on G/H. 
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(4) For peG , the coordinates at p = pH as discussed in the above proof 
are explicitly given as follows. Let ( N° > x ) be the coordinates at e given in 
remark (1) where 

x : N° -► D° : rc(exp £ x t X f ) -► £ . 

Then for the neighborhood P = z(p)N° we have that 

y : P-* D° : n(p exp £ x t X t ) -+'Zx i X i 
is a coordinate map where we regard m — R”. 

Exercise (1) Let H be a closed subgroup of the Lie group G and let 
g = /h + h be a fixed decomposition of the Lie algebras as above. Show that 
the map n : G -► G/// yields the epimorphism Tn(e) : T(G, e) -► T(G/H , e). 
Thus identifying g = T(G , e) show that ker T?r(e) = h and consequently 
T(G/i7, e) = glh S m. 

Example (1) Let G = SG(/i) and for/? + ^ = n let i/ = SGCp) which can 
be considered as a subgroup G by regarding i/ as matrices of the form 



where A is &p x p orthogonal matrix with det A = 1 and I the q x q identity 
matrix. The Lie algebra of G is g = so(n ) which is the set n x n skew-symmetric 
matrices. Thus the Lie algebra h = so(p) is given by the set of matrices 

[° ° 1 . 

|0 b 22 \ 

where B 22 is a p x p skew-symmetric matrix and m is given by the obvious 
complementary set of matrices in g , namely 



Thus for X given by (*) in a suitable neighborhood of 0 in m, the coordinates 
at e e G/H as discussed above are given by 

(exp X)H -► X. 

Exercise (2) Let S n ~ 1 be represented by SO(n)ISO(n - 1) as in example 

(4) , Section 3.2. Compare the coordinates near the point / e SO(n)/SO(n - 1) 
as given above with the stereographic projection coordinates given in example 

(5) , Section 2.1. 
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Definition 6.14 Let A be a nonassociative algebra with multiplication 
function a and let D be an ideal of A . Then the coset vector space A = 
A/D —{X + D : x e A} is made into an algebra as follows. For X = + Z> 

and F — 7 + Z) in A, define oc: A x A ^ A by 

oc(X , F) - a(JT, y) + D = a(JT, y). 

The algebra A is called a quotient algebra. 

As in the associative case, it is easy to verify that a is a function which is 
bilinear so that A with multiplication a becomes an algebra. Furthermore the 
map A -► A : X-> X is an algebra homomorphism. 

Proposition 6.15 Let G be a Lie group with Lie algebra g and let H be a 
closed normal (Lie) subgroup with Lie algebra h. Then the factor group G/H 
with the analytic structure given in Theorem 6.13 is a Lie group and its Lie 
algebra is isomorphic to the quotient algebra g/h where h is an ideal of g. 

Proof The factor group G/H is a topological group relative to the 
quotient topology and is an analytic manifold relative to the analytic structure 
of Theorem 6.13. Thus we must show that the operation 

G/H x G/H -► G/H : (xH, yH) xy~ X H 

is analytic. However, by remark (2), Section 4.1 and the uniqueness of the 
analytic structure of a Lie group, it suffices to show the multiplication 

\x : G/H x G/H -> G/H : (jc H, yH) -► xyH 

is analytic at (e, e). Thus let N° be the neighborhood of e in remark (2) with 
n o r = identity | N° where r : N° -► G is analytic. Then for x, y e N° we use 
7i G —> G/H as a group homomorphism to obtain 

/J(x, JO = p(n o r(x), n ° r(y )) = nn(r(x), r(y)), 

where g is the multiplication in G. Thus fi = nofio(rxr) which is analytic 
at ( e , e ). 

Now n : G -> G/H is an analytic homomorphism of Lie groups and if 
&(G/H) is the Lie algebra of G/H , then Tn(e) : g -► &(G/H) is a Lie algebra 
homomorphism. Thus since Ker n = //, we have by Proposition 6.11 and the 
characterization of h in Theorem 6.9 that ker(T 7 i(e)) = h is an ideal. Thus 
££{G/H) is isomorphic to g/h , 

Corollary 6.16 Let /: G H be a Lie group homomorphism and ass¬ 
ume that G is connected. Let/: G/Ker/-►/(G) be the induced isomorphism 
as abstract groups. Then / is an isomorphism of Lie groups where the Lie 
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group structure of /(G) (respectively G/Ker/) is given by Proposition 6.11 
(respectively Theorem 6.13). 

Exercise (3) Let /: G -► H be a surjective analytic homomorphism of 
the Lie groups G and H. Let H' be a closed normal subgroup of H and let 
G' =/ -1 (//'). Then show that the isomorphism G/G' ^ H/H' of abstract 
groups is actually a Lie group isomorphism. 


5. Commutative Lie Groups 


We shall apply some of the preceding results on homomorphisms to show 
that a connected commutative (i.e., Abelian) Lie group G is isomorphic to 
R q x T p for suitable integers p and q. This also shows that a connected Lie 
group G is commutative if and only if its Lie algebra g is commutative. 

Recall (Section 3.1) that a topological subgroup H of a topological group 
G is a discrete subgroup if H is a discrete subspace of G; that is, every subset 
of H is open in H. If G is a Lie group, then a discrete subgroup H can be re¬ 
garded as a zero-dimensional Lie group and in this case H is a closed sub¬ 
group of G. 

Proposition 6.17 The set H is a discrete subgroup of the additive Lie 
group R n if and only if H is isomorphic to Z p = Z x * • • x Z (p-times) for 
some p with 0 < p < n. 

Proof It is clear that Z p is a discrete subgroup of R n . Conversely, we 
shall now show there are linearly independent elements u x ,..., u p of R n which 
generate the discrete group H as a Z-submodule of R n . Thus H = Zu t + • • ■ + 
Zu p which gives the result. 

Lemma 6.18 Let H be a nonzero discrete subgroup of R n . Then H is 
generated as a group by elements which are linearly independent in R n . 

Proof We show this by induction on the dimension n. Thus for n = 0, 
the result follows. Now for n > 1, assume the lemma is true for n — 1 and let 
H be a nonzero discrete subgroup of R n . Let {u t , ..., u m } be a maximal set of 
linearly independent elements (over R) of H. First assume m <n. Then for 
n — 1 this set is empty. Thus for n> 1, we see that H is contained in a proper 

subspace; that is, H c= Ru x +-h Ru m = R m for m<«. We can now apply 

the induction hypothesis to obtain the result. 
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Next assume m = n and let V be the subspace of R n generated by {tq ,..., 
u n -i}- Then H n V is a discrete subgroup of V and so by the induction hy¬ 
pothesis we can assume H n V is generated by {w A ,..., w n _ x } as a Z-module. 

Let Q = { Yj = ibi u i : 0 < b % < 1 for i = 1,..«} be the compact cube 
in R n . Since H is discrete, Q contains only finitely many elements of H. Let 

P = jl bj u t eQ n H:b n > o|, 

Then u n e P and let u be an element in P with b n minimal (since there are only 
finitely many elements in P). We shall now show that the basis {u x ,..., 
of R n generates H as a group; that is, H = Zu x + * * * + + Zu. 

Thus let h = \hiiii 4- h n u e H for h ( e R with / = 1,..., n. Now we shall 
show h n e Z. Then since h — h n u e H n V, we have by the results of the pre¬ 
ceding paragraph that e Z. Let w — e P (with b n minimal) 

and let k n be the largest integer smaller than h n and let k t be the largest integer 
smaller than h t + bi(h n — k n ) = c t for i = 11. The following element 
in H 

n — 1 n ~ 1 

h ~ Z M; - = Z (*i ~ *i)«i + - k „)u 

i= 1 i = 1 

= Z - K) - + (*„ - k„) Z 

i= 1 i= 1 

n — 1 

= Z (°i - k i) u i + (h„ - k n )b n u n 

i= 1 

is in Q because by definition 

0 < c t — k x < 1 

and by definition 0 < h n ~ k n < 1 yields 

0 <b n {h n - k n ) <b n < 1. 

Therefore this element is in P. However, this last equation also contradicts 
the definition of b n as minimal in the choice of u e P if we assume h n is not an 
integer. Thus we must have h n e Z. 

Proposition 6.19 Let G be a commutative Lie group with Lie algebra g. 
Then, regarding g as an additive Lie group R m , the map exp : g -» G is a homo¬ 
morphism of Lie groups and Ker exp is a discrete subgroup of g. 

Proof From Section 5.3 we have seen exp : g -► G is a homomorphism. 
Now there exist neighborhoods U 0 of 0 in g and U e of e in G such that 
exp | U 0 : U 0 -» U e is a diffeomorphism. Therefore, since exp is injective on U 0 , 

(Ker exp) n U 0 = (0}. 
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Thus since U 0 is open in g , the set {0} is open in Ker exp. However, transla¬ 
tions are homeomorphisms so that {a} is open for every a e Ker exp. Thus 
since any set W of Ker exp is of the form W — \J aeW { a }, W is open; that is, 
Ker exp is discrete. 

Theorem 6.20 Let G be a connected commutative Lie group of dimension 
n. Then there exists an integer p, 0 <p < «, so that G is isomorphic to 
R n ~ p x T p . 


Proof Since exp X exp Y = exp(A" + Y ) and the connected group G is 
generated by a neighborhood U = exp U 0 , we see that the homomorphism 
exp : g G is surjective (also Section 5.3). From Corollary 6.16 (or directly), 
we have G and #/Ker exp are isomorphic as Lie groups. However, regarding 
g as the additive group R n , we have from the preceding results Ker exp is 
isomorphic to some Z p . Thus G is isomorphic to R rt /Z p = (R n ~ p x R p )/Z p 
which gives the results since T p is isomorphic to R P IZ P extending example 
(1), Section 3.2. 

Exercise (1) The results outlined below will be used in the chapter on 
solvable Lie groups. 

(i) Let G be a commutative group with + as its operation. Then G is 
called divisible if for any integer neZ and any x e G we have x e nG = 
{ny :y eG). Show if G and H are commutative divisible groups, then the 
direct sum G ® H ^ G x H is also divisible. Show that the additive groups 
R and T 1 = RjZ are divisible. Thus R q x T p is divisible. 

(ii) Let H be a subgroup of the commutative group G. Then H is a 
divisible subgroup if H is divisible as a group. Show that if H is a divisible 
subgroup of a commutative group G , then there exists a subgroup K of G 
such that G = H® K. (This is not too easy [MacLane, 1963; Barns, 1965].) 

(iii) Now let G be a commutative Lie group (written additively). Then 
we know that the identity component G 0 = R q x T p is divisible. Therefore we 
have an exact sequence 

0-► G 0 —!—+ G G/G 0 -> 0 

which splits since G = G 0 ® K. Because of this G —► G/G 0 —► 0 splits and 
therefore there exists a Lie group homomorphism/: G/G 0 -► G with n °/the 
identity on G/G 0 . Using this and the fact that G/G 0 is discrete (Theorem 
3.22), show that K is a discrete Lie subgroup of G. Thus we have shown that 
if G is a commutative Lie group (written additively), then G = G 0 ® K where 
the connected component G 0 ^R q x T p is a divisible subgroup and K is a 
discrete subgroup. 




CHAPTER 7 

AUTOMORPHISMS AND ADJOINTS 


We have already considered some results on automorphisms of Lie groups; 
for example, an automorphism 4>: G-+G induces an automorphism T(f>(e) : 
g g of the corresponding Lie algebra. We now develop some of the “ struc¬ 
tural” results for groups of automorphisms. Thus we first show that the 
automorphism group of a nonassociative algebra A is a Lie group whose Lie 
algebra is the derivation algebra of A. Next we develop the concept of inner 
derivations of a nonassociative algebra and the corresponding inner auto¬ 
morphisms. When the algebra is associative these reduce to the usual concepts 
of inner derivations and automorphisms. Using these results we use the differ¬ 
ential of an inner automorphism <j>(g) : G -► G : x -► axa~ l of a Lie group G to 
obtain an automorphism (T<£(a))(e) : g -> g of the corresponding Lie algebra g. 
Then we obtain a mapping Ad : G GL{g) : a -* ( T<j>(a))(e ) called the “ adjoint 
representation of G.” We develop formulas for the adjoint representation 
which lead to the result that the inner automorphism group of a connected 
Lie group G equals Ad(G). The fact that for a connected Lie group G, Aut(G) 
is a Lie group is proved in Chapter 8. 
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1. Automorphisms of Algebras 


Let A be a finite-dimensional nonassociative algebra over a field K with 
bilinear multiplication function a and define <x(X 9 Y) = X • Y = XY. Let 
Aut(/1) denote the automorphism group of A ; that is, the set of maps 
(j) e GL(A ) with <p(XY ) = 0(A)0(Y). Then Aut(^) is a subgroup of GL(A). 

Proposition 7.1 Let A be a nonassociative algebra over R . Then Aut(^4) is 
a closed subgroup of GL(A ). Thus Aut(^4) is a closed Lie subgroup of GL{A). 

Proof For X , Y fixed in A we note that since the multiplication in A is 
continuous (it is a bilinear map of the finite-dimensional space A), the set 
S(X 9 Y) = {<t>e GL(A) : <f>(XY) = </>(*)</>(T)} is closed. Thus Aut(>4) = 
0 {S(X, Y):X 9 Ye A) is closed in GL(A). 

Definition 7.2 A derivation D of a nonassociative algebra A is a linear 
transformation of A satisfying D(X • Y) = DX • Y + X • D Y for all X, Ye A. 

Let Der(^4) denote the set of derivations of A. Then it is easy to see that 
Der(v4) is a Lie subalgebra of gl(A). 

Proposition 7.3 Let D be a derivation of a nonassociative algebra A over 
R. Then: 

(a) exp tD e Aut(^) for all t e R; 

(b) the Lie algebra of Aut(^4) equals Der (^4). 

Proof Since Aut(^4) is a closed subgroup of GL(A ), (a) follows from (b) 
and Theorem 6.9. So to show (b), let D be in the Lie algebra of Aut(^4). Then 
from Theorem 6.9, we have exp tD e Aut(^4) for all t e R. Therefore for 
X ; Ye A 

(exp tD){X • Y) = (exp tD)X • (exp tD) Y 

and using the product rule, we differentiate with respect to t at t = 0 to obtain 
D(X • Y) = DX ‘ Y + X • DY\ that is, D e Der(^). To show the converse, we 
let D be a derivation and let 

a(0 - (exp tD)(X Y) and j8(f) = (exp tD)X • (exp tD) Y. 

Then we see a(0) = fi(0) = ATand d(0 = Doc(t) in A. Also using D as a deriva¬ 
tion we have 

ft(t) = D(exp tD)X * (exp tD) Y + (exp tD)X • D(exp tD) Y 
= Z)[(exp tD)X* (exp tD)Y] = Df}(t). 

Thus a and P are solutions to z = Dz satisfying the same initial conditions. 
This implies the desired result <x(t) = fi(t) using Proposition 2.36. 
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2. Inner Derivations and Automorphisms 

Recall that for a nonassociative algebra A over R we have defined the 
mappings L(X) : A -> A : Y -► XY and R{X) : A A : Y -► Y J7 for all e ^ . 
In particular, if A = g a Lie algebra, we have set 

L(X) = ad(A) 

called the adjoint mapping in g. 

Definition 7.4 Let A be a nonassociative algebra over R and let P be the 
subspace of End(A) spanned by L(X) and R(Y) for all X 9 Ye A. The Lie 
transformation algebra of A, denoted by L(A), is the Lie subalgebra of g/(A ) 
generated by P [Schafer, 1966]. 

Examples (1) If A is a Lie algebra with multiplication [17], then 
= ad(X) = — R(X) so that P = {ad(20: X e A}. Using the Jacobi 
identity, we see that 

[ad X, ad Y] = ad ([XY]) 

so that P is a Lie algebra of linear transformations; that is, L(A) = P 
(2) If A is associative, then we have 

L(X)L(Y)Z = X( YZ) = (XY)Z = L(XY)Z and L(X)L(Y) = L(XY); 

R(X)R(Y)Z = (ZY)X = R(YX)Z and R(X)R(Y) = R(YX ); 

L(X)R(Y)Z = X(Z Y) = R(Y)L(X)Z and L(X)R(Y) = R(Y)L(X ). 

Thus in this case P is also closed under commutation; that is, L{A) = P . We 
shall give examples later when L{A) is more complicated. 

Definition 7.5 A derivation D of a nonassociative algebra A is inner if 
D e L(A). Let Inn(^4) denote the set of inner derivations of A. 

Examples (3) If A i s a Lie algebra, then for any X e A we have D = ad(X) 
is an inner derivation. 

(4) Let A be associative, and let D = L(X} + R(T) be an inner deriva¬ 
tion. If A has an identity element 1, then 

D(l) = D( 1 * 1) = D( 1) • 1 + 1 • D( 1) 

so that 0 = Z)(l) = ( L(X ) + i?(Y))(l) = X + Y, Thus for an associative 
algebra with identity, inner derivations are of the form D — L(X ) — 7?(X). 
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Proposition 7.6 The set Inn(^f) of inner derivations of a nonassociative 
algebra A is an ideal in the Lie algebra Der(/(). 

Proof We first find a formula for L(A) as follows. Let I\ = P be as in the 
definition of L(A) and let P i+1 = [Pi, PJ. Then we have 

[Pi,Pm\^Pi+m for i, tn = 1 , 2 ,.... 

The case i = 1 follows from the definition. Now assume the results for i = k 
and all m. Then we use the Jacobi identity in gl(A) as follows for i = k + 1. 

[Pk+fPJ = [pP lf PJ,P m ] 

<=[[PuP m lP*] + [Pu[Pk,P m ]] 
c [Pm + X>Pk] + [Pu Pk + m\ 

c Pk + m+ 1) 

where we use the induction hypothesis on the third containment. Thus 
££ x P t is a Lie subalgebra of L(A) which contains P so that by definition we 
have 

L(A) = f j P i . 

i= 1 

Next we note by induction that if D e Der(^l), then [D, PJ <= P t . For 
i=l, this uses the formulas 

[Z>, L(X)] = L(DX) and [ D , R(X )] = P(Z)X) 

which follow from the definition of a derivation. For i > 1, we use the Jacobi 
identity again. Now since L{A) = £ Pi > this yields 

[Der(A L04)] c= L(v4). 

Finally, since Inn(^4) = Der(i4) n L(A) is a subspace of Der(A), we see 
[Inn(^4), Der(^)] <=.L{A) n Der(^) = Inn(/1); 
that is, Inn(yl) is an ideal in Der(^). 

Definition 7.7 An automorphism </> of A is inner if <j) is contained in the 
subgroup of Aut(yl) generated by exp(Inn(^4)). Thus <j> = exp L>i * * * exp D n 
where D t e Inn (A). Let Int(^) denote the subgroup of inner automorphisms 
of A, 

Example (5) Let i be a finite-dimensional associative algebra of 
endomorphism s containing the identity I. Let D = L(X) — R(X) e Inn (A). 
Then since L(X)R(X) = R(X)L(X) we have 

exp D = exp (L(X) - R(X)) 

= exp L(X) exp(— R(X)) = exp L(X) exp R( — X) 
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using exp (S + T) ~ exp S exp T if ST = TS . However, [exp L(X)](Z) = 
(X L(X) n /n !)Z = (^X n /n !)Z = (exp X)Z and similarly, [exp R(X)]Z = 
Z(exp X). Thus we obtain 

(exp D)Z = [exp L(X) exp R(-X)]Z 

= (exp 20Z(exp — X) = (exp X)Z(cx p 20“* 

which conforms to the usual definition of an inner automorphism as an 
endomorphism (p e GL(A) of the form <p(Z ) = UZU' 1 for some (7 e vL 

Later we shall give a criterion which shows that for many simple algebras, 
all derivations are inner. 


3. Adjoint Representations 


We now consider the differential of inner automorphisms to obtain a 
representation of a Lie group G in its Lie algebra g . 

Definition 7.8 Let V be a finite-dimensional vector space over a field K\ 
let G be a group, let g be a Lie algebra over K , and let A be an associative 
algebra over K, Then a group representation of G in V is a group homomor¬ 
phism G->GL( V). A Lie algebra representation of g in V is a Lie algebra 
homomorphism g->gl(V). An associative algebra representation of A in V 
is an associative algebra homomorphism A -> End (V). An injective repre¬ 
sentation is called faithful. 

Examples (1) If g is a Lie algebra over R and if ad {g) ~ (ad X : X € g}, 
then ad : g -► ad (g ): X ad X is a Lie algebra representation called the 
adjoint representation. For in this case we use the Jacobi identity to obtain 
a d[XY] = [ad X , ad Y] so that the map ad is a homomorphism. Next note 
that ker(ad) = {Z eg : [XZ] = 0 for all X e g} is the center Z(g) of g. Con¬ 
sequently the Lie algebras ad(#) and g/Z(g) are isomorphic. Thus if Z(g) — {0} 
then# is isomorphic to a Lie algebra of linear transformations. 

(2) If A is an associative algebra, then the map A -► End(^4) : X -> L(X) 
is a representation which is faithful if A has an identity element. 

(3) Let G be a connected Lie group, and let <p e Aut(G) be an analytic 
automorphism. Then T<p(e) e GL(g\ and the map 


Aut(G) - GL(g) : <j> T<p(e) 
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is a faithful representation of the subgroup of analytic automorphisms of 
Aut(G) in g. [Note Theorem 8.14 for the Lie structure of Aut(G).] To see this 
is a representation, we note for analytic automorphisms </>, \j/ e Aut(G), 

T((j> o xj/){e) = ° Tij/(e) = T<t>(e ) o Txj,(e). 

This representation is faithful; for if ( T(j)(e))(X ) = I(X) = X for all X eg, 
then by Proposition 5.19, 

</>(exP X) — exp(T</>(e)(X)) = exp X . 

Thus (j) = idy on a neighborhood U of e in G and since G is connected, G is 
generated by U. Thus since <j> is an automorphism, it is the identity on G (or 
we can use Proposition 5.20). Since continuous automorphisms of G are 
actually analytic, this example extends to a representation of the group of 
continuous automorphisms of G . 

Definition 7.9 Let G be a Lie group with Lie algebra g and for aeG 
let <j)(g) : G->G : x-^axa' 1 be the corresponding (analytic) inner auto¬ 
morphism. Then the mapping 

Ad : G - GL(g) : a -» (T^(a))(e) 

is called the adjoint representation of G. 

Proposition 7.10 The adjoint representation is an analytic homomor¬ 
phism of G into the Lie group Aut(#). 

Proof Since 0(a) is an analytic automorphism of G, (T0(a))(e) is an 
automorphism of g and 

T<t>(ab)(e) =(T(4>(d) o cf>(b)))(e) = T<t>(a)(e) o T<j>(b)(e). 

Thus Ad (ab) = Ad(a) *> Ad(6) so that Ad is a representation of G. To show 
that it is analytic, let X u ..., X m be a basis of g and let {x u .. . , xj be the 
corresponding canonical coordinate system. Now let 

Ad(a)(*,.) = [T<t>(a)(e)](X t) = £ a ji {a)X j 

j 

be the matrix representation of Ad (a). If x — exp tX i9 then from Proposition 
5.19 

<t>(a)x = <j>(a)(ex p tX t ) 

= exp [t T<f>(a)(e)(X j)] = exp[f £ a s fyi)Xj]. 

j 

Thus for t near enough to 0 e R, taj^a) are the canonical coordinates of <p(a)x. 
However, the canonical coordinates for <j>(a)x are given by commutator 
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formula in Theorem 5.16(b) and are analytic for a in a suitable neighborhood 
of e in G . Thus Ad is analytic at e e G and since it is a homomorphism of Lie 
groups, Ad is analytic on all of G. 

We shall now derive some formulas for Ad. 

Theorem 7.11 Let Ad : G -* Aut(^) be the adjoint representation of a Lie 
group G . Then 

(T Ad)(e) = ad; 

that is, the differential at e of the adjoint representation of the Lie group is the 
adjoint representation of the Lie algebra. 

Proof Since Ad : G Aut(^) we have TAd(e) : T(G , e) -> T(Aut(#), I ). 
Thus for X eg = T(G , e) we have the value X* = (TAd)(e)X e T(Aut(g), /) = 
Der(< 7 ); X * e End(^). Since Ad is a homomorphism, we have for t near 

0 in R 

(Ad)(exp tX) = exp[f(TAd)(e)2f] = exp tX*. (1) 

Thus since Ad(exp tX) e End(^), we have for Y e g and from the series exp tX* 
for the linear transformation X *, 

X*(Y ) = lim - (exp tX* - I)(Y) 

t^o t 

= lim - (Ad(exp tX) - /)(7). (2) 

r-»o t 

Now for s and t sufficiently near 0 in R we have 

exp [5 Ad(exp tX)Y] = Qxp[s(T(j)(Qxp tZ)(e))(7)] 

= ^(exp t2Q[expCyy)] 

= (exp tX)(exp ,yy)(exp — tX) 

= exp^y + st[XY] + so(t 2 )) 

using the definition of Ad, the fact a(exp X) = exp (Ta(e)X) for an analytic 
automorphism a of G, and Theorem 5.16(b). Therefore we can conclude for t 
near 0 in R 

Ad(exp tX)Y = y + t[XY] 4 - o(t 2 ). (3) 

Thus substituting (3) into (2) we obtain 

X*(Y) = lim \ (y + t[XY] - Y) = [XY]; 

t^o t 

that is, [T Ad(e)](Z) = X* = ad X. 
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Corollary 7.12 For X e g and a e G we have 

(a) Ad(exp X) = exp(ad X) (= e adX ) 

(b) a(exp X)a~ l = exp[Ad(a)(X)]. 

Proof Part (a) is just a restatement of formula (1) in the above proof. 
For (b) we have as in the above proof that for the inner automorphism 
4>{a) : G -* G : x -*• axa~ r 9 

a(exp A)a _1 = <£(a)(exp .X) 

« exp[(T0(a)(£f))(X)] = exp[Ad (a)(X)] 

using the definition of Ad. 

Corollary 7.13 If G is a connected Lie group with Lie algebra g, then 
Ad((7) = {Ad a : a e G} is a Lie group with Lie algebra ad(#). Thus Int(<?) the 
inner automorphism group of g equals Ad(G). 

Proof From Proposition 6.11, we have for the homomorphism /= Ad 
that/(C) is a Lie group [a Lie subgroup of GL(g)]. Also the Lie algebra of 
Ad(G) equals if(/(G)) = J 2 f(Im /) = Im(T/(e)) = Im(ad) = ad (g). 

Next if (j) e Int(^), then 4> = exp D 1 * * * exp D m where D t = ad X t . How¬ 
ever, exp(ad X t ) — Ad(exp so that (j) = Ad(exp X t * * * exp X m ). Since G is 
connected every element is of the form exp X t * • • exp X m and the results now 
follow. 

Corollary 7.14 Let C be a connected Lie group with Lie algebra g and let 
Z(G) be the center of C. Then 

(a) Z(G) is a Lie group and its Lie algebra is the center of g. 

(b) The kernel of the analytic homomorphism Ad : C -► Int(^) is Z(G) 
and C/Z(C) = Int (g) = Ad(G) as Lie groups. Thus if Z(G) = {e} 9 then 
Ad : C -► Int(^) is a Lie group isomorphism. 

(c) If the center of g is {0}, then the center of Int(#) is {/}. 

Proof Since Z(G) = {b e G : bxb~ l = x: for all jcgG} we see from con¬ 
nectedness of G that b e Z(G) if and only if 6(exp X) = (exp X)b for all 
X eg. Thus using Corollary 7.12(b) 

exp X = Z>(exp X)b~ x = exp[Ad(6)(A)]. 

This is the case if and only if 

Ad (b) = I 

which is the case if and only if b e Ker Ad. Thus we see Z(G) = Ad _1 (/) 
which is a closed Lie group by Proposition 6.11. Thus Z(G) = Ker(Ad) so that 
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by Proposition 6.11, the Lie algebra J?(Z(G)) = ker(TAd(e)) = ker(ad) = 
(Z e g : ad Z = 0} which is the center of g. The rest of (b) follows from the 
isomorphism theorem (Corollary 6.16) and (c) is an exercise. 

We shall now consider ideals and normal subgroups in more detail and 
extend the results of Proposition 6.15. 

Lemma 7.15 Let G be a Lie group with Lie algebra g, let V be a finite¬ 
dimensional vector space over R, and let ij/ : G ->GL( V) be an analytic map. 

(a) If : G —► GL(V) is a representation of G in V, then T\Jj{e ): g —> gl{V) 
is a representation of g in V. 

(b) If ip is a representation and IP is a subspace of V with <j/(a) W <= W 
for all a e G, then [TiP(e)(X)](W) c W for all X e g. 

(c) Conversely, if G is connected and \j/ is a representation and IP is a 
subspace of V with [T\J/(e)(X)](W) <= W for all X eg, then \j/(a) W <= IP for all 
aeG. 

Proof Part (a) is just a restatement of Proposition 5.19. For (b), let 
Xeg and weW. Then since [Tij/(e)](X) is a linear transformation and 
iA(exp tX) = exp[tTijj(e)(X)] we have 

[T<p(e)(X)]w = [d/dt exp t(T\J/(e)(X))\ t=0 ]w 
= d/dt[(exp tT<l/(e)(X))w] t=0 
- d/dt[\//(exp tX)w] l=0 . 

Thus if for all aeG we have t l/(a)w e IP, then [Tij/(e)(X)]w e IP. This is the 
case because the difference quotient for the derivative and its limit are in IP. 
Conversely for (c) let [Tijj(e)(X)]w e W. Then using the formula expZ = 
£ z ”/«! for a linear transformation Z we have t/r(exp tX)w = exp (tTij/(e)(X))w 
is in IP. Thus >j/(U)w e IP for some neighborhood U of e in G and since G is 
connected and t// is a homomorphism tj/(a)w e IP for any aeG. 

Corollary 7.16 Let G be a connected Lie group with Lie algebra g. 

(a) If H is a normal Lie subgroup of G, then its Lie algebra h is an ideal 
of g. 

(b) If h is an ideal of g, then the connected subgroup H generated by 
exp A is a normal subgroup of G. 

Proof Let \jj = Ad with V = g in the preceding results. Then for 
4>{a)x = axa 1 we have T<j>(a)(e ) = \jj{a). Thus if H is normal, then for any 
aeG'fit have <f>{a)H = aHa~ 1 c H so that, by Corollary 7.12, 

[T<fi(a)(e)](h) c= h; 
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that is, Ad (a)h c h. Thus for i// = Ad we have ad = T\j/(e ) so that by Lemma 
7.15 for all Xeg, 

(ad X)(h) = [Xh] <= h. 

Thus h is an ideal of g. 

Conversely if h is an ideal of g, then ad (g)h = [gh\ <= h. Thus for ij/ = Ad we 
have Ti//(e ) = ad is such that for any Xeg, 

[Tip(e)(X)]h = [Xh] c h. 

Therefore by Lemma 7.15(c), i//(a)h <= h ; that is, Ad(a)/r <= h. Consequently, 

a ■ exp(A) ■ a~ l = </>(<z)(exp h ) 

= exp[T<f>(a)(e)(h)] 

= exp[Ad(a)(/;)] c: exp h. 

Thus since the subgroup H generated by exp h is connected and <t>(a) is a 
homomorphism, H is normal. 

Exercises (1) If G is a connected Lie group with Lie algebra g, then 
Ad G is a normal subgroup of (Aut g) 0 (the identity component of Aut g). 

(2) Use Theorem 5.16 to deduce Corollary 7.16(a). 

Example (4) Let G be a connected Lie group with Lie algebra g and let 
H be a closed Lie subgroup with Lie algebra h. The pair (G, H) or (g, h ) is 
called a reductive pair if in the Lie algebra g there exists a subspace m such 
that g = m + h (subspace direct sum) and (Ad H)(m ) tz m. In this case the 
corresponding homogeneous space G/H is called a reductive homogeneous 
space. For the (fixed) decomposition ^ = m + Awecan introduce an algebra 
multiplication in m as follows: For X,Yem let [XY] = XYh{X, Y) 
where XY = [XY] m (respectively h(X, Y) = [AF] a ) is the projection of [XT] 
in g into m (respectively h). Thus m with the multiplication XY becomes an 
anticommutative algebra; that is, XY = — YX. This algebra is analogous to 
the Lie algebra of a Lie group and can frequently be used to obtain informa¬ 
tion about the space G/H. 

Exercise (3) Show that the Lie algebra identities of g yield the following 
identities for the above algebra m and the decomposition g = m + h. 

(i) XY = — YX (bilinear); 

(ii) h(X, Y) = —h(Y, X) (bilinear); 

(iii) [Zh(X, Y)] + [Xh( Y, Z)] + [ Yh(Z, X)] = (XY)Z + (YZ)X 
+ (ZX)Y; 

(iv) h(X Y, Z)+ h( YZ, Y) + h(ZX, Y) = 0; 

(v) [Ph(X, Y)] = h([PX], Y) + h(X, [FY]); 

(vi) [P XY] = [PX] Y + X[PY] 
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for all X, Y,Z em and P eh. In particular (vi) says that the mapping D(P) = 
ad m P : m-+m: X -► [PX ] is a derivation of the algebra m for each P eh. 

(4) Show that Ad H c= Aut(>w) where m is the above algebra. 

As an example consider G = SO(ri) and H — SO(p) for p < n as given in 
Section 6.4. Then relative to the decomposition g = m + h of Section 6.4, one 
obtains (G, H ) or (g f h) as a reductive pair (exercise). Note that the following 
situation arises: For p < n - 1, m is not the zero algebra; that is, XY^ 0. 
However, for p = n - 1 we have m is the zero algebra; that is, XY ~ 0. In 
general the pair ((7, H) or (,g , h) is called a symmetric pair if there exists a 
subspace m of g with g = m + h (direct sum) and Ad (H)m cz m and [mm] cz h . 
The corresponding homogeneous space GjH is called a symmetric space 
[Helgason, 1962; Loos, 1969]. 

Exercise (5) For n — 2p what can be said about the pair ((7, H) for 
G = GL(n , R) and H = Sp(p, R) ? 



CHAPTER 8 

SIMPLY CONNECTED LIE GROUPS 


In this chapter we review some basic facts on homotopy, fundamental groups, 
and covering spaces and apply these results to simply connected Lie groups. 
For example we show that if G is a simply connected Lie group, if is a Lie 
group, and /: g -► h is a homomorphism of the corresponding Lie algebras, 
then there is a unique Lie group homomorphism ij/ : G -► H with Txl/(e) = f. 
In particular, this implies simply connected Lie groups can be “classified” 
by their Lie algebras; that is, if G and H are simply connected Lie groups with 
isomorphic Lie algebras, then G and H are isomorphic Lie groups. This 
classification is nonvacuous since we show for a given Lie group G with Lie 
algebra g , there exists a simply connected Lie group G with Lie algebra iso¬ 
morphic to g. Finally we use various results to show that if G is a connected 
Lie group, then Aut(G) is a Lie group. 


1. Homotopy Review 


In this section we briefly discuss the basics of homotopy, the fundamental 
group, and show that the fundamental group of a Lie group is commutative. 

Definitions 8.1 Let M and N denote (Hausdorff) topological spaces and 
let / denote the closed interval [0, 1]. 

(a) Let / 0 and f x denote continuous maps M -> N. Then f 0 is homotopic 
to fi (denoted by / 0 - / x ) if there is a continuous map 

h : M x I ->N:(x 9 t)-+ h(x , t ) 
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satisfying h(x , 0) = f 0 (x ) and h(x , 1) = /i(x) for all x e M. 

(b) Let x 0 , jcj eM. Then a path in M from x 0 to x t is a continuous map 
(curve) a:/->Mso that a(0) = x 0 and a(l) = x x . The end points of a are x 0 
and x x and a is a closed path in case x 0 = x t . 

(c) Let a and ft be paths from x 0 to x x in M. Then a and ft are homotopic 
or equivalent (also denoted by a ~ /?) if there is a continuous map 
h : I x / -► M : (f, s) -> h(t , s') satisfying 

(i) A(0, j) = jc 0 and A(l, = jcj for all s e /; 

(ii) /*(t, 0) = a(t) and h(t 9 1) = f$(t) for all t e /. 

Thus a and ft are homotopic as functions with the additional restriction that 
the end points are fixed throughout the homotopy. The function h is called 
a homotopy of a to /?. 


Remark (1) The homotopy of paths is an equivalence relation. Thus 
for a path a with endpoints x 0 and x x we let [a] denote the equivalence class 
of a; that is, [a] is the set of all paths homotopic to a. 


Definitions 8.2 Let M be a topological space and let x 0 , x x x 2 , e M. 


(a) Let a be a path from x 0 to x 1 and let j8 be a path from x x to x 2 . Then 
the product of a and is the path a : I -> M given by 


OtfXO = 


ja(2t) 
\P(2t - 1) 


for 0 <t<\ 
for i < t < 1. 


(b) The inverse of a path a from x 0 to x 1 is a path a 1 from Xj to x 0 
given by 


a -1 (t) = a(l — t). 


Remarks (2) The product of two paths is continuous and so is the 
inverse. 

(3) Let a 0 ~ <x x and p 0 ~ p l be paths. Then: 

(i) «o 1 ~ wf 1 * 

(ii) if olqPo is defined, then o i 1 fi 1 is defined and oc o p o ~ . 

This allows us to define the product and inverse of equivalence classes by 
[ol][P] = [<xp] if oc/3 is defined 


and 


[a]" 1 = [a* 1 ]. 
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(4) For any z in M let e z : /-► M : t -> z. Then e z is a path in M with both 
endpoints equal to z. Let a be a path from * 0 to x t . Then we have 

fcr.]M = M = [a][e Xl l 

These considerations lead to the following result; see the work of Singer and 
Thorpe [1967] for various proofs. 

Proposition 8.3 Let M be a topological space and let x 0 e M. 

(a) The set of equivalence classes of closed paths with endpoints x 0 form 
a group under the operations of [a][/l] = [a/?], [a] -1 = [a -1 ], and identity 
[e Xo ] as above. This group is called the fundamental group of M relative to the 
base point x 0 and denoted by n t (M 9 x 0 ). 

(b) If M is path wise connected, then for any x 0 , x x e M we have n x (M, x 0 ) 
and n^M, are isomorphic groups. Thus in this case the fundamental 
group is essentially independent of the base point and we frequently write 
n x (M) for n x (M, x 0 ) and call n x {M) the fundamental group of M. 

(c) If /: M -► N is a homeomorphism of topological spaces, then the 

map/* : x 0 ) -► n t (N,f(x 0 )) : [a] -► [fo a] is an isomorphism of funda¬ 

mental groups. 

Remark (5) Let G be a connected Lie group. Then, since it is a mani¬ 
fold, G is also path wise connected. Thus the fundamental group of G , n^G), 
is isomorphic to n^G, e). 

Exercises (1) Let G and H be connected Lie groups. Show n^G x H) 
is isomorphic to n^G) x n x {H) as product groups. 

(2) Using Theorem 6.20, show that the fundamental group of a con¬ 
nected commutative Lie group is isomorphic to Z x • • • x Z, p-times, for 
some integer p. [Recall that n x (R) ^ {0} and n^S 1 ) = Z.] 

The following result shows that the fundamental group of a connected 
Lie group is Abelian. 

Proposition 8.4 Let M be a topological space with a continuous “multi¬ 
plication” function /x:MxM->M such that there exists eeM with 
p(x,e) = p{e, x) = x for all x e M. Then n x {M, e ) is Abelian and the pro- 
duct [x][fi] = [n o (a, /?)] in n t (M, e ) where n ° (a, p)(t) = n(<x(t), P(t)). 

Proof Let [a], [ fi ] be in n x (M, e ) and let /: I -► M be a closed path which 
is a representative of [a] and similarly let g : I -> M be a representative of [/?]. 
Since p(e, e) = e we can define a closed path k : I -> M with endpoints e by 
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for t 6/. Then k represents an element [y] in n x (M 9 e ) and [y] depends only 
on [a] and [/?]. We shall show [ot][p] = [y] = [/?][oe] and this also shows the 
formula [a][/?] = [p o (a, ft)]. 

First to prove [y] — [<x][fi] we can assume that the representative paths / 
and g satisfy f(t) = e for \ < t and g{t) = e for t <\. Then using e as a two- 
sided identity we have 


k(t) = 


fit) 

9(t) 


for 0 < t < 
for \ < t < 1. 


Now we define a continuous map h : / x /-* M : (t, s) /?(t, 5 ) by 

hd a = /AC® + 00 for 0 ^ h 0 < s < \ 9 

' \g((t — l)s + t) for \<t< 1, 0<s<\ 9 

and note /z(0, s ) = A(l, 5 ) = e for all ^ e /. Also since /*(r, 0) = k(t) and since 
h(t, 1) = (fg)(t), the product fg, we see that k is homotopic to fg; that is, 

M = WIW- 

To show [y] = [fi][<x] we first choose the representatives / and g so that 
/(0 = e for t < i and g(t) = ^ for \ < t. Then from k(t) = ju(/(r), ^(/)) we see 
[y] is represented by 


9(f) 0 < t < 

fit) i<t< 1 


However, as above, k is homotopic to the product gf\ that is, [y] = [/?][oe]. 


Definition 8.5 A topological space M is simply connected if it is pathwise 
connected and its fundamental group n Y {M) consists of the identity element. 
The space M is locally simply connected if for each p e M and each neighbor¬ 
hood U of p there is a simply connected neighborhood V with pe V cz U. 


Theorem 8.6 Let G be a simply connected Lie group with Lie algebra g 
and let H be a Lie group with Lie algebra h. Iff: g h is a Lie algebra homo¬ 
morphism, then there exists a unique Lie group homomorphism \j/: G H 
such that T\j/(e) =f 


Proof Since G is simply connected, it is connected, and consequently by 
Proposition 5.20 if \j/ exists, it is unique. Furthermore from Theorem 6.8 there 
is a local homomorphism <f >: G -> H with T<j){e) = /. We shall show that we 
can extend </> to the desired homomorphism \js with <f> — \j/ on a suitable nucleus 
in G. We follow the proof of Hausner and Schwartz [1968]. 

Thus let U be a connected symmetric nucleus in G so that <j)(xy) — 0(x)0(y) 
for x 9 y e U and also let V be a connected nucleus in G with Kc(/ and so 
that For a path <x:I^>G: t-+ a (t) with endpoints x 0 and x 1 we 
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define a fine partition of I relative to the path a (and U) as follows. Let 
0 = * 0 < fj < • • • < = 1 be a partition of / so that for all subintervals 

I k = t k ] for k = 1 ,..n we have 

s,tel k implies <x(s)~ 1 cc(t) e U. 

Such a partition is called fine. A fine partition exists as follows. Since / is 
compact and since the group operations are continuous we have that there 
exists a <5 > 0 so that if \s - t \ < <5, then a(5) -1 a(0 e U. Also note that a 
refinement of a fine partition is fine. 

For a fixed fine partition as above set A*a = a _1 (4-i) a (^) and define 

F(<x) = </>0M)<£(A 2 a) * • • </)(A n a). 

We now show that F(cc) does not depend on the partition of /but depends only 
on the equivalence class of the path a. First we show that if a point p is added 
to the above partition, then the value of F is unchanged. Thus we add p to 
the partition and obtain a new fine partition by considering the interval 
[t k - 1 , t k ] being replaced by [t k - u p]v [ p , **]. Then 

A*a = a” 1 (f k _ 1 )a(f k ) 

= a" 1 (f fc -i)a(p)a(p)" 1 a(t k ) = A'aA ff a. 

Since A'a and A"a are in U and </> is a local homomorphism on U we have 

<f)(A k a) = ^(A'a^CA'a). 

Thus using this new fine partition we see that the value F(a) remains the same. 
Therefore since any two fine partitions have a common refinement (which is 
fine), we see that F(ot) does not depend on the partition. 

Next F( a) depends only on the equivalence class [a], for let 

h : / x / -► G : (f, s) -► A(/, j) 

be a homotopy with A(f, 0) = /(0 and A(t, 1) = ^(0 where both/and # are in 
[a] (with endpoints * 0 and Now let p e / be fixed and let 0 = f 0 < / <*** 
< = 1 be a fine partition for the path h(t , ^). By continuity of h we can 

choose s el sufficiently close to p so that 


{*0 ! *1. • 

.., t n } is a fine partition for h(t . 


(1) 

h(t k , s)~ 

1 h(t k ,p)eU for 

k = 1, ... 


(2) 

h(t k ,p)~ 

' : /i(4 ,s)eU for 

A: = 1, ... 

, rt. 

(3) 


Briefly, for each t k we can find neighborhood “balls” B k of p in / so that 

h(t k ^ 1 ,ay 1 h(t k ,a)e U 
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for each ae B k . Thus since there are only finitely many balls B k , choose the 
one with the smallest radius B. Then for all s e B we see h(t k ~ l , , s) e U 

which proves (1). For (2) continue making the finitely many necessary choices 
of balls to obtain h(t k , a)~ 1 h(t k ,p)eU and then take the one with the smallest 
radius (or B) to obtain (2). For (3) use the fact that U is symmetric ( U~ l = U). 
We use these equations and induction to obtain for 1 < k < n, 

4>[h{to . p)~ l h(tup)] ■ • • 4>Wfk-up)~ l h{h ,/>)] 

= <f>[h(t 0 ,s)~ 1 h(t i ,s)\ ■■■ s)~ l h(t k , j)] • <j>[h(t k , (4) 

For k = 1 we use </> is a local homomorphism on U and h(t 0 , p) = 
h(t Q ,s) = x 0 , and Eqs. (1) and (2) to obtain 

4>[Kt 0 »= 4>Wo > s) ■ h{t u s)~ l h(t u />)] 

= mt 0 , smw u sy i Kh, p)i 

To pass from k to k + 1 we use <fr as a local homomorphism, Eqs. (l)-(3), and 
multiply both sides of (4) by 

<f>[Ktk,P)~ l Kh+uP)] 

= <!>[h(h,PY l h(tk,s) ■ h(t k ,s)~ 1 h{t kJrU i) • h(t k+1 ,5)~ 1 h(t k+1 ,p)] 

= (plKtk'Py'Kh,*)] • 4>lKh, s)~ l h(t k+1 , 5)1 • <p[h(t k+1 , s)~ 1 h(t k+1 ,p)l 
Thus for k = n in (4) we use h(t n , p) — h(t n , s) — x x to obtain 

F(h(t, p)) = F(h(t, sj) 

for s sufficiently near p in L Since p is arbitrary in I we can use the transitivity 
of the homotopy of paths to conclude that if/, g e [a], then F(f) = F(g) = F(a). 

Now let x 0 and x t be arbitrary elements in the simply connected group G. 
Since G is pathwise connected, there exists a path a : G from x 0 to x x and 

any two such curves are homotopic since G is simply connected. Thus we can 
set 

F(x o, x t ) = F(cc) 

because F(oc) depends only on the equivalence class [a]. In particular for 
x 0 = e and x t = x arbitrary in G we define the desired homomorphism by 

^ : G -► H : x -► F(e , x). 

First we shall show \j/ equals </> on a connected nucleus V of G where V <= U 
and V~ X V cz U. For choosing such a neighborhood V we have for x e V and 
a a path entirely in V which joins e to x that 0 = t o <t 1 = lisa fine partition 
of / and 

\j/(x) = <£[a(< o ) -1 «0i)] = 0(a(l)) = 4>{x). 

Next we prove a few more formulas to show ^ is a homomorphism. 

F(x, xy) = F(e, y ) for x,yeG, 


(5) 
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for let a join e to y. Then using (xa) 1 (xb) = a l b we have 
F(e, y) = F(a) 

= <f>(A 1 cc)<p(A 2 a) • • • 0(A„a) 

= 0[a(/ o ) -1 a(ti)] •" 0[a(f„_i) _1 a(r„)] 

= <f>[(x<x(to))~\xa(ti))] •' • 0 [(xa(t„_i)) _ 1 (jca(t„))] 

= F(xoc) = F(x, xy) 

using the independence of the path for the last equality. 

Next let a join eto x and let ft join x to xy. Then F( oe/?) = F(a)F(P) where 
a/? is the product of paths (Definition 8.2). This follows because the interval 
may be finely partitioned so that the end point of a and the starting point of 
P are in the partition. Thus since F depends on the equivalence class of a 
path, F(ap) = F(cc)F(fi) gives 

F(e, xy) = F(e, x)F(x, xy) for x, ye G. (6) 

Thus for x 9 yeG we have 

\J/(xy) = F(e, xy) 

= F(e , x)F(x , xy\ using (6) 

= F(e 9 x)F(e, y) 9 using (5) 

- \j/(x)il/(y). 

Thus ^ is a homomorphism which is clearly continuous and consequently 
analytic. 

From this result we see that simply connected Lie groups can be “ classi¬ 
fied ” by their Lie algebras as follows. 

Corollary 8.7 Let G and H be simply connected Lie groups with Lie 
algebras g and h . If g and h are isomorphic Lie algebras, then G and H are 
isomorphic Lie groups. 


2. Simply Connected Covering Groups 


We shall now show that given a Lie group G with Lie algebra g there exists 
a simply connected Lie group G with Lie algebra isomorphic to g. As an 
application, we shall use G to show that Aut(G) is a Lie group for a connected 
Lie group G. 
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Definition 8.8 Let M be a (Hausdorff) topological space. Then: 

(a) M is locaily connected if for each point pe M and each neighborhood 
V of p , there exists a connected neighborhood U of p with U c= V; 

(b) M is locally pathwise connected if for each p e M, every neighborhood 
of p contains a pathwise connected neighborhood of p. 

Exercises (1) Let M be the space in R 2 which is the union of the graph 
sin n/x for xe(0, 1] and a path joining the points (1, 0) and (0, 1). Show that 
M is pathwise connected but not locally pathwise connected. 

(2) Is a manifold locally pathwise connected? 

Definition 8.9 Let M and M be pathwise connected and locally pathwise 
connected spaces and let p : M -> M be continuous. Then the pair (M, p) is a 
covering space of M if: 

(a) p is surjective; 

(b) for each p e M there exists a neighborhood U of p so that /? _1 (C/) is 
a disjoint union of open sets, each of which is mapped homeomorphically 
onto U by p. 

Remark (1) If (M, p) is a covering space of M, then the map p : M -> M 
is an open map; that is, for each open set U of M we have p(U) is open in M 
[Singer and Thorpe, 1967]. 

Examples (1) Let M = S 1 and M = R, and let p : M -> M : x -> e 2nix . 
Then (M, p) is a covering space for M. For each z e M we have that p~ 1 (z) 
consists of infinitely many points. 

(2) Let M = M = S 1 and let p : M -> M : x -► x 2 . Then (M, p) is a 
covering space of M. In this case p~ x {x) consists of two points for each x e M ; 
that is, we have a “double covering.” 

(3) Let M = T 2 (= S 1 x S l ) and let M - R 2 . Define p : M M : 
(x,y) -* ( e 2nlx , e 2niy ). Then (M, p) is a covering of M. 

We shall assume all spaces are pathwise connected and locally pathwise 
connected. For the proofs of the following see the work of Chevalley [1946] 
and Singer and Thorpe [1967]. 

Theorem 8.10 Let (M, p) be a covering space of M and let N be simply 
connected. 

(a) \f f:N-+M is continuous, then there exists a continuous map 
J:N-+M so that f = p °J. 

(b) If /, g : N -> M are continuous maps so that p of = p o g and J(a) = 
g(a ) for some a e N, then / = g. 
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(c) Let a:/-^Mbea path in M so that a(0) = x 0 . Let x 0 e M be such 
that p(x 0 ) = x 0 . Then there exists a unique path a : /-► M so that p ° a = a 
and a(0) = x 0 . 

Theorem 8.11 Let (M, /?) be a covering space of M and let a e M, a e M 
be such that p(a) = a. Then there exists a one-to-one correspondence between 
p~ l {a} and the coset space a)jp^ n x (M, a); recall p^n^M, a) = 

{[p° a]: [a] e n t (M, a)}. 

Theorem 8.12 Let M be a pathwise connected, locally path wise connected, 
and locally simply connected space. Let H be a subgroup of a ). Then 

there is a covering space (M, p) so that p * n t (M, a) — H where a e M is such 
that p{a) = a. In case H — {e} we see M is simply connected; that is, there is 
a covering space (M, /?) with M simply connected. 

Proof Using the above theorems we sketch the construction of (M, p) ; 
for the remaining details, see the proof of Singer and Thorpe [1967]. For 
motivation, note that if (M, p) exists, then each path a in A? starting at a(0) 
is the unique lift of a = p o a in M and also the point a(l) in M is determined 
by [a] = [p o a]. Consequently we are led to construct the points of M from 
paths in M as follows. Let Q be the set of paths in M beginning at the point 
ae M and define an equivalence relation = on Q by a = ft if and only if 
a(l) = p(\) and [a/? -1 ] e H. (For the simply connected covering, this is just 
a ~ /?.) Let <a> denote the equivalence class of a under the relation = and 
let M be the set of all equivalence classes <oc>. We define p : St -► M by 
/;«a)) = a(1) and (M, p) is the desired covering space. 

Theorem 8.13 Let G be a connected Lie group. Then there exists a unique 
simply connected Lie group G which is locally isomorphic to G ; that is, G and 
G have isomorphic Lie algebras. There is a mapping p : G -» G so that ((/, p) 
is a covering space of G and p is a homomorphism and a local isomorphism. 
Also Ker(p) is a discrete subgroup of G which is isomorphic to n^G) and 
Ker(p) is in the center of G . 

Proof The uniqueness up to isomorphism follows from Corollary 8.7. 
For the existence, we let (G, p) be the simply connected covering space of G 
as constructed in Theorem 8.12; the points are the equivalence classes [a] of 
curves a : I^>G with a(0) = e and/?([a]) = a(l). We make G into a group as 
follows. The product is given by [oc][fi] = [y] where y(t) — cc(t)p(t) which is well 
defined since a homotopy of a and a homotopy of /? multiply to give a homo- 
topy of y. The identity is e = [e] where e(t) = e the identity in G. Inverses are 
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given by [a] -1 = [a -1 ] where a _1 (0 = a(f)" 1 . The map p : G -> G is a homo¬ 
morphism because /?([a][j8]) = a(l)/?(l) - /?([a])/?(Lff]). Next note that p maps 
a neighborhood of e in 5 homeomorphically onto a neighborhood of <? in G. 
Consequently the analytic manifold structure on G defines an analytic mani¬ 
fold structure on G with which G becomes a Lie group. Now p : G G is a 
Lie group homomorphism which is a local isomorphism. 

Next since p is a local isomorphism Ker p is a discrete normal subgroup 
of G. Therefore by Proposition 3.25, Ker p is in the center of G. Since G is 
simply connected, Theorem 8.11 implies that Ker/? = p~ l {e) is isomorphic 
to n^G). Note this argument also shows n^G) is Abelian. 

We now use the simply connected covering group to sketch the proof that 
the automorphism group of a connected Lie group is a Lie group; for more 
details see the proofs of Chevalley [1946], Hochschild [1965], and Loos [1969]. 

Theorem 8.14 Let G be a connected Lie group. Then Aut(G) is a Lie 
group. 

Proof First let G be simply connected with Lie algebra g . From Theorem 
7.1 we know Aut(^) is a Lie group. Let a e Aut(#). Then there is a unique 
0 e Aut(G) with (T0)(e) = a (Theorem 8.6). Using the chain rule, the map 
Aut(G) -► Aut(#) : 9 -* a is a group isomorphism. This isomorphism induces 
an analytic structure on Aut(G) where the topology of Aut(G) is given as 
follows [Chevalley, 1946; Hochschild, 1965]. Let K be a compact subset of G 
and let V a neighborhood of e in G. Let N(K, V) denote the set of all elements 
9 in Aut(G) so that 9(x)x ~ 1 and 0 _1 (x)x” 1 are in V for every x in K . Then the 
family of these sets N(K, V ) forms a family of nuclei for the identity of Aut(G). 
Thus in this case Aut(G) is a Lie group. 

Next suppose G is not simply connected and let (5, p) be the simply con¬ 
nected covering group of G. We shall show that Aut(G) is isomorphic to a 
closed subgroup of Aut(5) so that Aut(G) can be regarded as a Lie group 
using Theorem 6.9. Now with N = M = G in the notation of Theorem 8.10 
we have for 6 e Aut(G) that 6 o p : G -> G is continuous. Consequently there 
exists a unique continuous map 8 : G^G with 9 ° p = p o 8 and 8(e) = e. Let 
H = Ker p which is discrete and note that we have the mapping 

G x G -+ H : (x, y) -► 9(xy)d(y)~ 1 8(x)~ 1 , 
for using 6 ° p = p ° 8 and p as a homomorphism we see 
p(8(xy)8(y) " 1 0(x) ~ 1 ) = p8(xy)pS(y) ~ 'pd(x) ~ 1 

= dp(xy)6p(yy 1 dp{x)~ 1 
= 0(p(x)p(y))6p(y) “ l 6p(x) ~ 1 = e 
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using 6 e Aut(G). Therefore since G x G is connected and H is discrete we 
have from Section 2.3 that 8(xy)8(y)~ 1 8(x)~ 1 = e and consequently 8 is an 
analytic homomorphism of G . Since 6 is an automorphism we also easily 
obtain that 8 is an automorphism. Now note for x e 7/ = Ker p that e = 
0(p(*)) = 0/>(*) = />0(x) so that 8(H) c //. Also we have H cz 0(H). Thus 
8(H) = H. Therefore if we let K~{de Aut(S): 8(H) = H) we obtain AT as a 
subgroup of Aut(6) and a homomorphism Aut(G) -► K : 9 -> 8. Also K is a Lie 
subgroup of Aut(S) using H is closed and Theorem 6.9. 

Conversely, let 8 e K 9 let jc e G and let x be any element in G so that 
p(x) = x. Then noting that the value p(8(x)) depends on x and not on the 
choice x, we set 0(x) = />(#(x)). Then 6 is an analytic automorphism of G. 
Thus the mapping Aut(G) -► K : 9 8 is an isomorphism which makes 

Aut(G) into a Lie group. 

Remarks (2) If G 0 denotes the identity component of a Lie group G, 
it is shown in Loos [1969] that if G/G 0 is finitely generated, then Aut G is a 
Lie transformation group acting on G. 

(3) For the construction of the simply connected covering group (G, p) of 
specific groups G we refer to the work of Chevalley [1946], Freudenthal and 
deVries [1969], and Tits [1965]. 




CHAPTER 9 

SOME ALGEBRA 


Since a more algebraic approach will be taken in the remaining chapters, 
we now introduce some of the necessary algebra. Many of the proofs do not 
actually depend on the use of real or complex numbers but only on the 
characteristic; consequently all the fields we use will be of characteristic 0. 
First we discuss tensor products of vector spaces and linear transformations. 
Using this we consider how to extend the underlying field of the vector 
space to its algebraic closure and apply this to real Lie algebras. Thus we 
discuss the complexification, the realification, and real forms of a Lie algebra. 
Next elementary results on semisimple (i.e. completely reducible) associative 
algebra and Lie algebra modules are derived and finally Cayley algebras are 
considered. 


1. Tensor Products 


Since we shall eventually compute characteristic roots, compare real and 
complex Lie algebras, etc., we now review some general concepts concerning 
tensor products. 

Definition 9.1 Let V and W be finite-dimensional vector spaces over a 
field K of characteristic 0. A tensor product over K of the vector spaces V 
and W is a vector space T over K together with a bilinear map 

r: Vx W^T 
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so that for every bilinear map 

B:VxW-*E, 

where E is any vector space over K, there exists a unique vector space homo¬ 
morphism h:T~* E satisfying h ° z = B; that is, the accompanying diagram 
is commutative. 


V x W — L ~* T 



E 


Thus the bilinear map can be factored by the linear map h and the unique 
“universal” bilinear map z. The following facts are proved by Jacobson 
[1953, Vol. II] and Lang [1965]. 

Theorem 9.2 Let V and W be vector spaces over K. 

(a) A tensor product of V and W over K exists. 

(b) If (T, t) and (T', t') are tensor products over K of V and W, then 
there exists a unique isomorphism / : T-* T' so that / •i=t. Thus tensor 
products are unique up to isomorphism and we speak of “ the ” tensor product. 

(c) If {AT i, ..., X„\ is a basis for V over K and { Y t , ..., Y m } is a basis 
for W over A, then {z(X { , YJ) : i = 1,..., n and j = 1,..., m) is a basis of the 
tensor product T over K. Thus T is finite-dimensional over K and dim T = 
(dim F)(dim W). 

We shall use the notation 

V <8> k W or V ® W 
for the tensor product T and 

X ® Y 

for the elements z(X, Y) in T. Thus elements in V ® W are finite sums 
£ Xi ® Y t for Xj e V and Y { e W. This uses 

a(X ® Y) = aX ® Y = X ® aY 
for XeV, Ye W, and a e K. 

We now consider tensor products of homomorphisms. Let S: V-*V' and 
T: W -* W' be homomorphisms of vector spaces over K and let 

SxT :Vx W-* V x W': (X, Y) -► (S(X), T(Y)). 

For the tensor products (V ® W, z) and (V (x) W', z') we note that 
z' o (S x T): V x W-> V ® W' 





180 


9. SOME ALGEBRA 


is bilinear. Thus by definition, there exists a unique vector space homomorph¬ 
ism P : V ® W ^ V' ® W' so that P o r = r' o (S x T). From this we obtain 

P(X 0 7) = S(X ) 0 7(7) 

and we use the notation P = S 07 or S 0 X 7 called the tensor product of 
the homomorphisms 5 and 7. 

Now let ^ and B be nonassociative algebras over the field K. We shall now 
construct a multiplication on the tensor product 7 = A ® K B as follows. 
Since 7 is generated by elements of the form X 0 7, we define for X = 
X %i 0 7j and X' = ]T X/ 0 7/ in 7 the bilinear function 

H : T x T -J: (X. X') -£ X,*/ 0 7*7/. 

Thus 0 K B becomes an algebra over called the tensor product of the 
algebras A and B. 

Exercises (1) How can an algebra A with bilinear multiplication 
ix : A x A -* A be defined in terms of A 0 A ? 

(2) Let U, V, W be vector spaces over K. Then show the following 
isomorphisms 

V ® K K^V^K 0 K V, V® K W^W® K V, 

U ® K (V ® K W) & (U ® K V) ® K W, 

End x (K) 0 End K (FP) = End K (V 0 W), 

V = L Vi an d W = £ Wj implies V 0 W S £ F* 0 Wj. 

(3) If V and W are nonzero vector spaces over K, show that the tensor 
map r:VxlV->V®l¥is not injective. 

(4) Let S: V V and T : W-> W be vector space homomorphisms. 
Find the kernel of 5 0 T. Show that if S and T are isomorphisms, so is 
S ® T. 

(5) Let A and B be associative algebras over K. Show that A ® K B is 
an associative algebra over K. 


2. Extension of the Base Field 


We continue the notation of the preceding section and discuss how one 
can extend the base field of a vector space to a larger field—in particular, the 
extension to the algebraic closure of the original field; see the work of Jacob¬ 
son [1953, Vol. II; 1962] and Lang [1965], We also consider extensions of 
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homomorphisms and discuss in detail the real and complex case in the next 
section. 

Definition 9.3 Let V be a vector space over the field K and let P be a 
field extension of K which is a finite-dimensional vector space over K. Form 
the tensor product P ® K V and regard it as a vector space over P by defining 

p(ZPi<s> xd = Y,pPi® x t- 

This is well defined and the vector space axioms for P ® x V over P hold. We 
denote this vector space over P by V(P) and call it the vector space obtained 
from V by extending the base field K to the field P. 

Proposition 9.4 Let V be a vector space over K with basis X i9 ..., X m . 
Then the vectors 1 ® X u ..., 1 ® X m form a basis of V(P) over P where P 
is a field extension of K. Thus the dimension of V over K equals the dimension 
of V(P) over P. 

Proof Let X t = 1 ® X t . Then for p eP we have pX t = p( 1 ® 2Q = 
p ® X t . Thus since any X in V(P) has the form Y,Pi ® X i9 X also has the 
form Thus the X t are generators for V(P) over P and they are also 

linearly independent over P (exercise). 

Remarks (1) The vector space V(P) = P ® K V is also a vector space 
over K and in this case dim K V(P) = (dim x P)(dim K V) as noted in Theorem 
9.2. 

(2) For a basis X l9 ..., X m of V over K, the set of A-linear combinations 
of the elements 1 ® X u ...» 1 ® X m form a subset V— {1 ® X : X e V} 
of V(P). Then V is a A-subspace of V(P) and the map V -► V : X -*> 1 ® X 
is a vector space isomorphism. Thus V can be identified as a AT-subspace 
of V{P) and V satisfies the following: 

(i) The vector space spanned by V over P equals V(P). 

(ii) If N is a subset of V consisting of linearly independent vectors over 
K, then N consists of linearly independent vectors over P. 

Next let A be a nonassociative algebra over K with P a field extension of 
K and let A(P) = P ® K A be the vector space obtained by extending the base 
field. Then regarding A(P) as the tensor algebra of P and A as in Section 9.1, 
A(P) becomes an algebra over P by the multiplication 

(I>i ® *,)& Qi ® Yj) = TPiVj ® X, Yj. 
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(3) Let X u ..., X m be a basis of the algebra A over K and let the structure 
constants c k tj e K be given by 

X i X J = Zc* J X k . 

k 

As above the vectors X u ..., X n with X t = 1 0 X t are a basis of A{P) over 
P, and from various definitions 

X t Xj = (10 X t )(l ® Xj) 

= 1 ®X i X J = Zc* j X k ; 

k 

that is, the structure constants relative to corresponding basis in A(P) are 
the same. 

Exercise (1) Let A be an associative or Lie algebra over K and let P 
be a field extension of K. Show that the corresponding extension A{P) is an 
associative or Lie algebra. 

We next consider the extension of the linear transformation T: V -► 
of vector spaces over K. Thus since V(P) = P ® K V and W(P) = P 0 K 
we let 

T= / 0 T : V(P)-+W(P) 

be as in Section 9.1. Then T is called the extension of T and is specifically 
given by 

T£pi®X i )=Zp i ® TX t . 

In particular, if X u ..., X n is a basis of V and T(X t ) = £ a j{ Xj, then for the 
corresponding basis X t9 ..., X n of V(P ) we have 

T(X i )=10TZ I .= X^- 

Thus the matrix of T relative to X u ..., X n is the same as the matrix of T 
relative to X u ..., X n . 

Remark (4) A variation of these results is frequently applied when the 
extension field P is the algebraic closure of K ; in particular when the base 
field R is the real numbers and the extension C is the complex numbers. 
Thus one starts with a real vector space V and an endomorphism T: F 

for which one needs to know information about the characteristic roots. 
Then we pass to the extension T : F(C) -► F(C) to compute this information. 
Frequently the results are already in R or one can prove certain results hold 
over R if and only if they hold over C. 
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More specifically, let V be a vector space over K. Then an endomorphism 
T: F is called split if all the characteristic roots of Tare in K. An assoc¬ 

iative or Lie algebra A over K is called split if all the left multiplications 
L(X) : A -> A : Y -> XY and all the right multiplications R(X) : A -► A : 
Y ->YX are split endomorphisms. 


3. Complexification 


We now apply the preceding results to the case when the base field is the 
real numbers R and the extension field is the complex numbers C. We develop 
some terminology for this case and give an example of a real simple Lie 
algebra for which the complex extension is not simple. 

Let V be a vector space over R. Then noting that for X, Ye V we have 
X + i ® Y e V(C) = C ® R V , we can formally think of 

V(Q = {X + iY:X,YeV and /=(-1) 1 ' 2 } 
with the complex number multiplication 

(a + ib)(X + iY) = (aX — bY) + i(bX + a 7). 

Note that V c: V(C) by identifying V = V + i{Q}. 

Definition 9.5 Let g be a Lie algebra over R. Then the complexification 
of g is the Lie algebra g(C) = C ® R g. 

We also use the notation 

9=9 + i9 

and note that the multiplication in g is given by 

[U+iV X+iY ] = [UX] - [VY] + /([ VX] + [UY]). 

Example (1) We now consider various ways of obtaining complex and 
real Lie algebras from a given Lie algebra. Thus let g be the three-dimensional 
Lie algebra with basis E , F, H over C with multiplication given by 

[HE] = 2E, [HE] = - IF , [EF] = H. 

Then g is a simple Lie algebra; that is, g has no proper ideals (Definition 6.10). 
For suppose A is a subspace of g so that [gh] c h and let 


X — aH + bE + cF e h 
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with a # 0. Then from the multiplicative relations 

[X E] = 2aE-cH eh 

so that 4 aE = [//[XF]] is in h. Thus since a # 0 we see E is in h. From the 
multiplicative relations of g this implies h = g. If a = 0, then similar argu¬ 
ments also show h = g. 

Next let X = iH, Y = iE, and Z = zFbe in g and let g R be the six-dimen¬ 
sional vector space over R with basis H , F, F, X, Y, Z. With the multiplication 
in g R induced from g\ for example, 

[XY] = [iH iE] = i 2 [HE] = -2F, 

we obtain the accompanying multiplication table for g R , where * is computed 



H 

E 

F 

X 

Y 

Z 

H 

0 

IE 

-IF 

0 

2 Y 

—2Z 

E 


0 

H 

-2 Y 

0 

X 

F 



0 

2 Z 

-X 

0 

X 




0 

-2E 

2 F 

Y 


* 



0 

-H 

Z 






0 


using the anticommutivity of the multiplication. Thus g R becomes a six¬ 
dimensional Lie algebra over R which is simple (exercise). 

Now let g R — C ® R g R be the complexification of g R as above. Then g R 
is six-dimensional over C but g is three-dimensional over C. With the basis 
H , F, F, X , Y, Z of over C let k (respectively k) be the complex subspace 
of g R spanned by 

H+iX , F+zY, F+iZ 

(respectively H — iX, E—iY, F — iZ). From the multiplicative relations 
in the above table we obtain [kk] = k , [kk\ = {0}, and [kk\ = Jc so that A; and k 
are ideals of g R . Also note the direct sum 

g R = k © U. 

The C-linear map (p : g k given by 

m = i(H + iX) 9 0(F) = KF + z’Y), 0(F) = i(F 4- /Z) 

is a Lie algebra isomorphism. Similarly k is isomorphic to g. In summary, we 
see that the simple complex Lie algebra g yields a simple real Lie algebra g R . 
However, the complexification g R of this real simple algebra is not a simple 
complex Lie algebra. 





3. COMPLEXIFICATION 


185 


Example (2) There exist real Lie algebras which are not isomorphic 
but have the same complexification. Let n > 1 and let 0 < /c < [«/2], where 
[n/2] means the largest integer less than or equal to n/2. Let 



where I } is they x j identity matrix. Then a k defines a symmetric nondegener¬ 
ate bilinear form B k on the real vector space V = R n [note example (7) 
Section 2.3]. Let 

SO{k, n — k) = {T e SL(R "): B k (TX, TY) = B k (X, Y) all X, Y e R”j 
which has Lie algebra 

so(k, n — k) = {S e sl(R"): B k (SX, Y) + B k (X, SY) = 0 all X, Y e R”}. 

Then so(k, n~k ) and so{\, n — are not isomorphic if k ^ 1 (exercise). 

However, the complexification of so(k, n — k) for any k is just 

so(n, C) = {S e gl(C n ): B(SX, Y) + B(X, SY) = 0 all X, Ye C n } 
where B is the complex symmetric bilinear form with matrix a = I „. 

Exercise (1) A conjugation in a complex Lie algebra g is a function 
C : g -+ g so that for X, Y eg, a eC, and a the conjugate of a in C, we have 

C[XY] = [CX CY], C 2 = I, 

C(X +Y)=CX + CY, C(aX) = dC(X). 

(i) Let g be a real Lie algebra and let g = g + ig be its complexification. 
Show that C: g -*g : X + iY -* X — iTis a conjugation and g is the fixed 
point set of C. 

(ii) For the Lie algebras k and k in the above example (1), show that 
there exists a conjugation of g such that C : k^k. 

(iii) Let h be a real Lie subalgebra of the Lie algebra g and let g be the 
complexification of g as in (i). Show that the complexification h = C (x) R h 
is a subalgebra of g. Conversely, show that if h is a subalgebra of g so that 
C(fi) <= h where C is given in (i), then h is the complexification of some real 
subalgebra of g. 

Definition 9.6 Let g be a Lie algebra over C of complex dimension n. 

(a) By restricting the scalars to R, the Lie algebra g can be considered as 
a Lie algebra of dimension 2 n over R denoted by g R and is called the realiza¬ 
tion or realification of g. 
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(b) Let g R be the realization of the complex Lie algebra g , let h be a real 
subalgebra of g R , and let h = h + ih be the complexification of h. Then h is 
called a real form of g provided there exists a complex Lie algebra isomorphism 
(j): g -» h so that for all X eh. 

Remark (1) Frequently in the definition of real form, it is required 
that g = h. 

Example (3) Let g be the three-dimensional complex Lie algebra of 
example (1), this section. Then the six-dimensional algebra^ is the realization 
of g. The real subalgebra h of g R generated by { H , E , F } is a real form of g. 

Exercise (2) Is the realization k R of k in example (1) a real form 

of 9 r ? 

Proposition 9.7 Let g be a complex Lie algebra and let h be a subset of g. 
Then h is a real form of g if and only if h is the set of fixed points of a con¬ 
jugation of g . 

Proof Let C be a conjugation of g which has h as its set of fixed points. 
Then clearly h is a subalgebra of the realification g R . Next let h be the com¬ 
plexification of h and define a map 

, r v X + ax , .{X - C(X)\ 

j— + •(—%—) 

which is an isomorphism so that for all X eh 

<j)(X) = C(X) = X. 


Thus h is a real form of g. 

Conversely if h is a real form of g , we can identify g with h by the iso¬ 
morphism in the definition of real form; that is, let g = h + ih as a direct 
sum. Then the map 


C : g ->g : X + iY X — iY 
is a conjugation with h as its set of fixed points. 

Exercises (3) Show that the map </> defined in the above proof actually 
maps g into the complexification of h. Thus what can be said about the 
expressions (X + C(X))/2 and (X - C(X))/2i for all X eg! 
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(4) Let h be a real form of# = h 4- ih, and let C : g -► g : X + iY -► X - zT 
be a conjugation. Let <? be another conjugation of g with h as its fixed point 
set. Show that C = C. What can be said if we regard g = h + ihl 

(5) Show that there exists a complex Lie algebra which has no real form, 
possibly as follows: 

(i) Let g = CX + CY + CZ where the basis, X, Y , Z of this three- 
dimensional algebra has multiplication [XY] = oeY, [XZ] = 0Z, [YZ] = 0 
for some cc, e C. Then g' = [gg] = CY + CZ. 

(ii) Assume h is a real form of #. Then h is the set of fixed points of a 
conjugation C of g. Show that h! = g' n h is a real form for #'. 

(iii) Let U = aX + 6Y +cZ be in h but not in A' so that a ^ 0. Then 
ad* U leaves A' invariant and so induces a real endomorphism of h!. Show 
that (ad U)Y = ctaY and (ad U)Z = fiaZ so that oca and fia are real character¬ 
istic roots. Thus a//? = a a/fla e R. However a and can be chosen in C so this 
cannot happen which contradicts the assumption of a real form. 

Definition 9.8 Let g be a complex Lie algebra. The conjugate Lie 
algebra #* of g is given as follows. The algebra #* is the same Abelian group 
as g and has the same algebra multiplication as g . However, the scalar multi¬ 
plication * in #* is given by a * X = aX , where a is the complex conjugate of 
a e C and the scalar multiplication aX is that in g . 

With these definitions #* is clearly a Lie algebra over C. 

Theorem 9.9 Let g be a complex Lie algebra and let g R be its realization 
and g R be the complexification of g R . Then 

g R ^g 0#* 

where the (external) direct sum of g and its conjugate algebra #* contains these 
algebras as ideals. 

Proof If A + B denotes the elements in g © #*, then we define the map 

T : g R ^g © #* : X+ iY^ (X + iY) + (X + i * Y) 

where the X , Y eg R are uniquely determined by X + iY e # R = C ® g R . 
Thus the elements X + iY e g and X + i * Y e g* are well defined as elements 
of these algebras over C. Therefore T is well defined and is a Lie algebra 
isomorphism. For example, let X + zT, U + iV e g R . Then 


[X + iY U + iV[ = [XL] - [YK] + i([YU] + [XL]) 
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in g R gives 

T([X + /Y U+ iV]) = ([XU] - [YV] + i([YU] + [IK])) 

+ ([*C] - [YV] + / * ([YU] + [IF])) 

= ([*£/] - [YF] 4- [lYt/] + [XiV)]) 

+ ([XC] - [YF] + [/ * YU] + [Xi * V]) 

= [X + iY U + iV] + [X + i* Y U + f* V]. 
However from the definition of multiplication in a direct sum we also have 
[T(X + iY)T(U + iV)] = [(X + iY) + (X + i*Y)(U+ iV) + (U + i * F)] 

= [X + /Yt/+/F] + [A r + i* Y(/+i* K] 

which shows T preserves products. The proof that T is a vector space iso¬ 
morphism is left to the reader. 

Exercise (6) Compare example (1) of this section with Theorem 9.9. 


4 . Modules and Representations 


We briefly review the basics of modules and representations for associa¬ 
tive algebras and make the corresponding definitions for Lie algebras. 
However, throughout most of the text we shall view these concepts in the 
framework of algebras of endomorphisms acting on a vector space. 

Definition 9.10 Let A be an associative algebra over a field K and let V 
be a vector space over K . Then V is a (left) T-module provided there exists a 
bilinear mapping A x V : (5, X) SX satisfying ( ST)X = S(TX) for 
all S, T e A, and X e V, and a(SX) = (aS)X = S(aX) for all a e K. 

Remarks (1) Given an ^-module K, for each S e A we can define the 
endomorphism p(S ) : VV by p(S)X = SX. Thus since p(S)p(T)~ p(ST) 
we see that the action of A on V is given by the action of the algebra of 
endomorphisms p(A) = {p(S ): S e A} on V . Conversely if A is an associative 
algebra acting on V according to the above formulas, then V is an ,4-module. 

(2) According to Definition 7.8, the above mapping p : A -► p(A) : S -► 
p(S) is a representation of A in V. Conversely given a representation p of 
A in F, then we can make F into an ^-module by defining the action 
Ax V -► F: (5, X) -► p(S)X. Thus the concepts of an associative algebra of 
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endomorphisms acting on a vector space, ^-modules, and representations 
are the same. 

(3) We assume the reader is familiar with submodules, quotient modules, 
module homomorphisms, etc., but we discuss completely reducible (i.e., 
semisimple) modules in the next section. 

Definition 9.11 Let g be a Lie algebra over a field K and let V be a vector 
space over K. Then V is a ^-module provided there exists a bilinear mapping 
g x V -> V: (5, X) -> SX satisfying 

[ST]X = S(TX) - T(SX) 

for all S, Teg and X e V, and a(SX) = (aS)X = S(aX) for all a e K. 

Remarks (4) As for the associative case, one has the definitions of Lie 
submodules, quotient modules, homomorphisms, etc. Thus if V and W are 
^-modules, then <p : V -+ W is a ^-homomorphism if cj> is a homomorphism 
of the vector spaces and commutes with the action of g; that is, 4>(SX) = 
S(4>X) for all S e g and X e V. 

(5) If V is a ^-module, then for each S e g we can define the endomor¬ 
phism p(S) : V -> V by p(S)X — SX. From the definition we see 

p([ST]) = p(S)p(T) - p(T)p(S) = [p(S\ p(T)] 

so that the mapping p : g -> p(g): S -> p(S) is a representation ofginV accord¬ 
ing to Definition 7.8. Thus a ^-module yields a representation and also 
conversely; for if p is a representation of g in V, then V becomes a ^-module 
by defining g x V - V : (S, X) -+ p(S)X. 

(6) Let P be a set of linear transformations acting on the finite-dimen¬ 
sional vector space V over K\ that is, we have a mapping P x VV : (S, X)-+ 
SX so that S(aX + bY) = aS(X) + bS(Y ) and we can regard P <= End(F). 
Then we can form the associative algebra of endomorphisms s/(P) or the 
Lie algebra of endomorphisms £P(P) generated by P. Thus s#(P) consists of 
all finite sums of products of elements of P, and J£(P) consists of all finite sums 
of commutators of elements of P; note Proposition 7.6. In either case we can 
regard V as an j/(P)-module or a J^(P)-module, and the P-invariant sub¬ 
spaces are just submodules. 

Examples (1) Let V be a vector space over R , let G be a Lie group, 
and let p : G -* GL{V) be a (differentiable) representation of G in V. Thus 
regarding V as a manifold, G operates differentiably on V by the action 

G x V^> V: (S, X) -* p(S)X 

(recall Definition 3.17). From Lemma 7.15 we see that if g is the Lie algebra of 
G , then Tp(e): g -> gl(V) is a representation of g in V. Thus Fis a ^-module. 
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A more direct computational way of viewing this is to regard G <= GL(V) 
with Lie algebra g c: gl(V). Then for S , Teg we have for t near 0 in R that 
for X e V, 

(e tS e tT )X = 

Using this formula and the Campbell-Hausdorff formula for computing 
exp tS exp tT = exp {tS 4- tT + \t 2 [S 9 T] 4- • * *) one obtains 

[S, T]X = S(TX) - T(SX ) 

as expected. 

(2) We can construct more modules using tensor products as follows. 
First let A be an associative algebra over the field K and let V and W be 
^-modules. Then V ® K W becomes an ^-module when the action of A is 
given by P(£ X % ® Y t ) =£ PX t ® PY t for P e A and X 4 eV,Y x e W. Simi¬ 
larly if g is a Lie algebra over K and V and W are ^-modules, then V ® K W 
becomes a ^-module when the action of g is given by 

P(ZXt ® Y t ) = I PX t ®Y i + X i ® Py t 
for Peg and X t e V, Y t e W (that is, “differentiate” the associative action). 


5. Semisimple Modules 


We continue the notation of the preceding section and discuss irreducible 
modules and their direct sum. We also review the basics of finite-dimensional 
semisimple associative algebras and their modules; for elementary ref¬ 
erences see the work of Jacobson [1953, Yol. II; 1962], Lang [1965], and 
Paley and Weichsel [1966]. 

Definition 9.12 Let V be a finite-dimensional nonzero vector space over 
K . Let A be an associative subalgebra of End(K) or let A be a Lie subalgebra 
of gl(V), thus V is either an associative or Lie module. 

(a) The vector space V is a simple or irreducible ^-module if the only 
^-submodules of V are V and {0}. In this case A is called an irreducible algebra 
of endomorphisms on V and say A acts irreducibly on V. 

(b) The vector space V is a semisimple or completely reducible ^-module 
if V is a vector space direct sum of irreducible ^-modules. In this case we 
frequently say that A acts in a completely reducible manner on V or A is a 
completely reducible algebra of endomorphisms on K. 
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We summarize some standard results [Jacobson 1953, Vol. II; Lang, 
1965]. 

Proposition 9.13 Let V be a vector space over K and let A be an associa¬ 
tive or Lie algebra of endomorphisms of V so that V is an .4-module. 

(a) Then V is a completely reducible ^-module if and only if for every 
^4-submodule W of V there exists an ^4-submodule W' so that V = W 4- W* 
which is a submodule direct sum. 

(b) If V is completely reducible with decomposition V — V x + • * * + V t 
into irreducible subspaces V t , then the V { are uniquely determined up to an 
^-isomorphism and the length t is unique. In this case, we can choose a basis 
of V consisting of bases of the components V t so that each S e A has block 
matrix 


0“ 

S 2 

0 s t _ . 

The matrix S t represents the action of 5 on V t . 

Now we restrict A to be associative. 

Proposition 9.14 Let V be a vector space over K and let A be an associa¬ 
tive algebra of endomorphisms of V so that V is an ,4-module. Let A = 
{T e End(K): TS = AT for all S e A} be the centralizer of A in End(K). 

(a) (Schur’s lemma) If V is an irreducible yl-module, then A is a 
division ring; that is, A is an associative algebra over K for which every non¬ 
zero element has an inverse. In this case, if the field K is algebraically closed, 
then A = KI where / is the identity endomorphism. 

(b) (Burnside’s theorem) If A is an irreducible algebra of endomorph¬ 
isms on V and if the field K is algebraically closed, then A = End x (K). 

Examples (1) Let A be a nonassociative algebra over K with bilinear 
multiplication function a. Then we have previously considered an ideal 
B of A as a subspace so that a (B, A) <= B and oc(A, B) B. Now, as in Section 
7.2, we let R(X) and L(X ) be the right and left multiplication functions on A 
and let P be the subspace of End(^4) spanned by all R(X) and L(Y) for 
X , Y e A. Then an ideal is just a subspace B which is invariant under the 
associative algebra s/(P) or the Lie algebra SP(P\ In particular an algebra A 
is simple if and only if A 2 =£ {0} and A has no proper ideals; that is, A 2 ^ {0} 
and A is ^/(P)- or Jz?(P)-irreducible. 
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In particular, if A is an associative algebra over K which is simple, then 
A is isomorphic to a suitable n x n matrix ring E over some division ring 
D *=> K; that is, the matrix ring of all n x n matrices {a i} ) where a {j e D. See 
the elementary proofs of Lang [1965] and Paley and Weichsel [1966]. 

(2) There are many starting points for the definition of a semisimple 
nonassociative algebra—depending on the class of algebras being studied. 
However, regardless of the starting definition, the usual conclusion is that A 
is a semisimple algebra if A 2 ^ {0} and A = A x © • • • © A t a direct sum of 
ideals which are simple algebras. Thus A is a completely reducible or 

JSP(P)-module. 

In particular, let A be a finite-dimensional associative algebra over K. 
An ideal N of A is nilpotent if there exists an integer k so that {0} = N k 
( = NN ... A, Axtimes). One can show that the sum of two nilpotent ideals is a 
nilpotent ideal and consequently define the radical of A , rad(^), to be the 
maximal nilpotent ideal. We shall show in Chapter 12 that if A is an 
associative algebra so that A 2 ^ {0}, then A = A x © • • * © A t is a direct sum 
of ideals which are simple algebras if and only if rad (A) = {0}; that is, A is a 
semisimple ^-module if and only if A has no nonzero nilpotent ideals. 

Note that if A is semisimple, then A = A t © • • • © A t where each A { 
is isomorphic to some ring of all x n { matrices over some division ring. 
Thus, in particular, each A x has an identity e t so A has an identity 1 = 
e t + 

These remarks can be used to prove the following results. 

Proposition 9.15 Let V be a finite-dimensional vector space over K and 
let A be an associative subaigebra of End(F) such that V is a completely 
reducible ^-module. Then A is semisimple. 

Proof Let N = rad(^4) be the maximal nilpotent ideal in A and let 
V = V t © • • ■ © V t where the F f are nonzero ^-irreducible submodules. Let 

NV t = {X T k X k : T k e N and X k e V,). 

Then since N is an ideal in A we see that NV t is an ^-submodule of V which is 
contained in V { . Since V { is irreducible, NV { equals V ( or {0}. If NV t = V it 
then 

N 2 Vi - N(NVi) = NVi = Vi 

and by induction K,- = However, since N is nilpotent, this equation 

implies V t — {0}, a contradiction. Thus NV { — {0} for all i, so that N = {0}; 
that is, A is semisimple. 
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Remarks (1) Before proving the converse statement we briefly review 
results on a simple associative algebra A over K in terms of an n x n matrix 
algebra E over a division algebra D =d K; see the more formal proofs of 
Lang [1965] and Paley and Weichsel [1966]. First the identity matrix I — 
E x + • • • + E„ where E 2 = E t are idempotents which are matrices of the 

01 


oj 

form with 1 in the (/, ^-position. Thus EE t is an irreducible ^-submodule con¬ 
sisting of matrices of the form 


E; = 


tfli 

0 : 0 

. a ni . 

where a pi e D . Consequently E = EE X + • * * + EE n is the direct sum of these 
irreducible i>modules. Going back to A we see that the identity 1 e A can be 
decomposed 1 = e x -f • * • + e n so that the = Ae t are irreducible ,4-sub- 
modules (i.e., left ideals) and A = + ■ • • + B n is a direct sum. 

(2) Let B t = A e t be an irreducible left ideal of A as above and let Fbe an 
,4-module. If X e V is such that e t X ^ 0, then B t X # {0} and B t X is an 
,4-module. The map 


: Bi^BiX :b^bX 

is an ,4-module homomorphism and ker(</>) is an ,4-submodule of B t . Since B { 
is an irreducible ,4-module and B t X # {0} we have <j> is an isomorphism; 
that is, B t X is an irreducible ,4-submodule of V. 

(3) Let A be an associative algebra with identity 1 and let V be an 
,4-module. Then V is called a unital ,4-module if \X = X for all X e V. 

Let A = B x © • * * © B„ as in remark (1), let V be a unital ,4-module, and 
let 0 ^ X g V. Then X = IX = e x X + * * • + e n X so there exists e t with 
e { X / 0. Thus B t X is an irreducible ,4-module and X e B x X + • • * + B n X y 
a sum of irreducible ,4-modules. This implies V is a completely reducible 
,4-module. Thus we leave as an exercise: Let {Wjij e a} be a family of irre¬ 
ducible ,4-submodules of V so that every X e V can be expressed as a finite 
sum of elements from the W/ s. Then V is a direct sum of irreducible A- 
submodules. 



194 


9 . SOME ALGEBRA 


For a semisimple algebra A we combine the above remarks to obtain the 
following. 

Proposition 9.16 Let A be a finite-dimensional semisimple associative 
algebra over K and let V be a finite-dimensional ,4-module. Then V is a 
completely reducible ,4-module. 

Proof Since 1 e A let V 0 = {Z e V : 1Z = 0}. Then V 0 is an ,4-submodule 
of V such that AV 0 = {0}. Next note any X e V can be written 

X = 1X + (X- IX) 

where l(\X) = IX and 1(X - IX) = 0; that is, X = Y + Z where Z e V 0 
and Y e V x = {U e V : 1 U = £/}. This gives the direct sum V = V ± + V 0 
where V x is a unital ,4-module and therefore completely reducible. Now 
choosing a basis for V 0 we can write V 0 = KX t + • • • + KX r as a direct sum 
of one-dimensional ,4-modules which are irreducible. Combining these de¬ 
compositions, V is completely reducible. 

Remark (4) Concerning complexification, we shall use bilinear forms 
in Chapter 12 to easily show a (finite-dimensional) associative algebra A 
over R is semisimple if and only if the associative algebra C ® R A over C 
is semisimple. This with the preceding results shows that if 1 e A and V is an 
^[-module, then V is a completely reducible ,4-module if and only if C ® R V 
is a completely reducible C ® R ,4-module. 


6. Composition Algebras 

We will now construct some nonassociative algebras which will be very 
useful in describing certain simple Lie algebras in Chapters 13 and 14. This 
material also yields some interesting applications of the previous material in 
this chapter. 

Definition 9.17 A composition algebra ^ over a field K of characteristic 
0, is a nonassociative algebra # over K with an identity element denoted by 1 
and a map N -> K called the norm of # such that the following three 
properties hold: 

(a) N(aX) = a 2 N(X) for any a e K and any X eW; 

(b) N(XY) = N(X)N(Y) for any I, 7 e^; 

(c) B(X , Y) = [N(X + Y) - N(X) - N(Y)\/2 for all X, Y e <£ defines a 
nondegenerate symmetric bilinear form on 
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Exercises (1) Show that N(l) = 1 and B(X, X) = N(X) for all X e 
where X, is any composition algebra. 

(2) Suppose Xe K 'is not a square in K. Let XX = K(X l/2 ) be the quadratic 
field extension of K and define N(a + bX 1/2 ) = a 2 — b 2 X. Show that '€ is a 
composition algebra. 

(3) Show that the associative algebra of 2 x 2 matrices over K is a 
composition algebra if we define N(X) = det(X). 

Definition 9.18 For any composition algebra XI and any X e Xi define 
X = 2B(X, 1)1 — X. Then X is called the conjugate of X and the overbar is 
called the involution of c €. Notice that B(X, 1)1 = (X + X)/2 and that if we 
write X — a\ + Y with B(Y, 1) = 0, then X = al — Y. 

Proposition 9.19 For any X, Y, Z, W in a composition algebra X we 
have 

(a) B(XY, XZ) = N(X)B(Y, Z) = B(YX, ZX) ; 

(b) B(XY, WZ) + B(XZ, WY) = 2B{X, W)B{Y, Z); 

(c) X 2 - 2B(X, \)X + N(X)l = 0; 

(d) B(XY, Z) = B(Y, XZ) = B(X, ZY ). 

Proof (a) From Definition 9.17 we see 
2 B(XY, XZ) = N(XY + XZ) - N(XY) - N(XZ) 

= N(X(Y + Z)) - N(XY) - N(XZ) 

= N(X)(N(Y +Z) - N(Y) - N{Z)) = N{X)2B(Y, Z). 

This kind of proof is routine and is referred to as “ linearizing ” equation 
9.17(b) with respect to Y. 

(b) This is proved by linearizing (a) with respect to X. 

(c) We note that 

B(X 2 - 2B(X, 1)X + N(X)l, Z) = B(X 2 , Z) - 2B(X, l)B(X, Z) 

+ N(X)B(\,Z) 

= B{X 2 ,Z-\) + B{X-\,Z-X) 

- 2 B(X, Z)B(X, 1) = 0. 

Both (a) and (b) above were used in the computation. Now the formula in (c) 
follows from the nondegeneracy of B(X, Y). 

(d) From (b) we have 

B(XY, Z) + B(XZ, Y) = 2 B(X, 1 )B(Y, Z), 
and thus 

B(XY, Z) = B(Y, 2B(X, 1 )Z - XZ) = B(Y, XZ), 
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and finally the other parts of (a) and (d) which were not proved follow from 
obvious symmetries. 

Exercise (4) Prove that the following formulas hold for any X and Y 
in a composition algebra: 

(i) XX=XX = X(X) 1; 

(ii) Yy=yx 

(iii) X = X 

Properties (ii) and (iii) are often used to define an involution. 

Definition 9.20 Given a composition algebra over K and any 0 ^ a e X, 
define a new algebra ^<a> with underlying vector space ^ x ^ and a multi¬ 
plication and quadratic form N' given by the following Cayley-Dickson 
formulas for X , Y, Z, W e 

c X , Y)(Z, W) = (XZ + aWY, WX + YZ), 

N'((X, Y)) = N(X) - aN(Y). 

We will shortly state a theorem which shows that deciding when the algebra 
^<a> is a composition algebra is a way of classifying all composition algebras. 

Remarks (1) Rather than using pairs (X, Y), one can describe #<#> 
by using a symbol, say u. Then ^<( a > = ^ + *€u = {X 4- Yu : X, Y e^} with 
N\X + Yu) = N(X) - aN(Y) and X(Wu) = (WX)u 9 (Yw)Z = (YZ)w and 
(Y«)(JYw) = a WY. 

(2) Since K is a composition algebra if we define N(a) = a 2 , we can 
consider X<oc> for any 0 ^ oe e X and if this is a composition algebra, consider 
X<oe, /?> = X<a></?> and continue until one obtains an algebra which is not a 
composition algebra. The X<a)’s are called quadratic algebras, the X<a, /?>’s 
are called (generalized) quaternion algebras and the X<(a, /?, y)’s are called 
(generalized) Cayley algebras. We are justified in giving names to these 
algebras by Hurwitz’s theorem, the proof of which can be found in the work of 
Schafer [1966, Chap. 3]. 

Theorem 9.21 (Hurwitz) All quadratic, quaternion, and Cayley algebras 
are composition algebras. If ^ is a composition algebra over K of dimension 
greater than 1, then ^ must be isomorphic to some quadratic, quaternion, 
or Cayley algebra over X. Quadratic algebras are commutative and associa¬ 
tive, quaternion algebras are associative but not commutative, and Cayley 
algebras are neither commutative nor associative. 
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Remark (3) Proposition 9.19(c) can be used to show that any algebras 
isomorphism of two composition algebras also leaves the norms and bilinear 
forms invariant. Thus the isomorphisms referred to in Theorem 9.21 can be 
thought of as isomorphisms of only the algebra structures or as norm-invariant 
isomorphisms. 

Example (1) We often refer to 5<-l, -1> as “the” quaternion 
numbers. It can be described as the real associative algebra with basis 1 ,u,v, 
uv , where u 2 = v 2 = -1, vu = -uv , and N(a + bu + cv + duv) = a 2 + b 2 
+ c 2 + d 2 . Then 5< — 1, — 1) is a division algebra. 

Definition 9.22 A composition algebra # over K is said to be a split 
composition algebra if it possesses zero divisors or equivalently if it is not a 
division algebra. 

Proposition 9.23 The following statements about a composition algebra 
# over K are equivalent: 

(a) ^ is a split composition algebra; 

(b) there exists a nonzero X with N(X) = 0; 

(c) there exists anle^ with B(X , 1) = 0 and N(X) = -1. 

Proof (a) implies (b) The algebra <€ split implies there exist nonzero 
X,YeV with XY = 0 and 0 = N(XY) = N(X)N(Y) so N(X) = 0 or N(Y) = 0. 

(b) implies (c) Suppose N(X) = 0 for 0 ^ X e <€. If B( 1, X) # 0, set 
Y = 1 —5(1, X)~ x X and compute that 5(Y, 1)^0 and N(Y) = —1. If 
5(1, X) = 0, then from the nondegeneracy of the bilinear form on ^ we 
must have that ^ is four- or eight-dimensional and there exists some Ze^ 
with 5(1, Z) = 0 and B{X , Z) = 2. Let Y = Z — ( N(Z) -h 1)AT and compute 
B(Y, 1) = 0 and A(Y) = -1. 

(c) implies (a) If with B(X , 1) = 0 and A(2Q = -1, then, by 

Proposition 9.19(c), X 2 = 1; so (X + 1)(X - 1) = 0 and so X + 1 is a zero 
divisor and ^ is a split composition algebra. 

Proposition 9.24 Over any field K there exist split quadratic* quaternion, 
and Cayley algebras. Any two split composition algebras over K of the same 
dimension are isomorphic. 

Proof We see A^<1>, AX 1, 1>, and AX1, 1, 1> are split algebras from 
Proposition 9.23. Thus it suffices to show that any split composition algebra 
over K must be isomorphic to one of these three algebras. 
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Assume <€ is split. Proposition 9.23(c) shows that there exists an U e <<? 
with 5(1, U) = 0 and N(U ) = — 1. Since U 2 = 1, si = K\ + KU is a sub¬ 
algebra of # and it is trivial to check that a\ + bU —► (a, b) is an isomorphism 
from si onto K( 1). 

If has dimension greater than 2, we must continue. Choose any 
0 # Y s V with B(X, Y) = 0 for all X e si. If X e si with N(X) = 0, then 
N(XY) = 0 and by Proposition 9.19(d) B(Z, XY) = 0 for all Zb si. Now 
repeating an argument used in the second part of the proof of 9.23 there exist a 
V e # with B(Z, V) = 0 for all Z e si and N(V) = — 1. It is claimed that 
si + si Vis isomorphic to s/(l) s A^<1, 1>. Using Proposition 9.19(d) it is 
easy to show that N(X + ZV) = N(X) — JV(Z) as required so we need only 
check the nontrivial multiplication formulas. 

Assume X, Y. Z e si in this paragraph. Since B(X, YV) = B(XY, V) = 0 
and B(X V, YV) = —B(X, 7) it is clear that the bilinear form restricted to 
siV is nondegenerate and si Vis orthogonal to si. For any W e ( €, X, Y e si 
and V as above 

B(YV, WX) + B(YX, WV) = 2 B{Y, W)B{V , X) = 0 

so B({YV)X,W) = B({YX)V,W) for all WeV and (YX)V = (YV)X. 
Applying the involution to both sides of this equation yields X(YV) = ( YX)V 
which is one of the three multiplication formulas required. 

The second formula (XV)Y = (XY)V follows easily from the first and 
involution formulas. 

Finally, for any W e <€ 

B(W(YV), VX) + B(WX, V(YV)) = 2B(W, V)B(YV, X) =0 

so (X_V)(YV) = (V(Y_V))X. However B(W, V(YV)) = —B{VW, VY) = 
B{W, Y )so V(YV)= Y and (XV)(YV) = YX. 

This completes the proof that 3$ = si + si V is a subalgebra of iso¬ 
morphic si(l} which is isomorphic to AT<1, 1>. If is eight-dimensional, 
we now repeat the above argument word for word to complete the proof of 
the proposition. 

Exercises (5) Show that the algebra of 2 x 2 matrices over is: is a split 
quaternion algebra. Also show that AT<a> ss K(a l/2 ) if 0 ^ a e K is not a 
square in K and K(a') is split otherwise. 

(6) Show that if ^ is a composition algebra over K and L is a field 
extension of K, then = L ® K is a composition algebra over L where 
N(a ® X) — a 2 N(X) for a bL and X b^. Also show that if ^ is split, then 
so is ( 3(L). 

(7) Suppose that ^ is a division (nonsplit) composition algebra of 
dimension greater than 1, over K and that N(X) = a # 0 for some X e # 
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with B(X, I) = 0. Show that #(£((-a) 1/2 )) = K((-x) 112 ) ® K V is a split com¬ 
position algebra over K(( — ol) 1/2 ). Use this result to show that if '6\ and 
are any two composition algebras of the same dimension over K , there exists 
a field extension L of degree at most 4 over K so that t (L ) = 

Example (2) The following matrix-type description of a split Cayley 
algebra is sometimes useful. Let K be any field of characteristic 0 and let 'V' 
denote the set of three-dimensional column vectors over K. For any u.veY' 
let u x v and (u, F) denote the usual cross product and inner product, respec¬ 
tively. Let 

*“{[* U ^:a,beK,u,ver j 

and define 


a u 

1 _ \ ac — ^ 

aw + du + v x z 

v b 

J [z d J [cv + bz + u x w 

bd — (v, w) 


and 

*([« *])=“* + (“-”> 

for all a, b, c, d e K and u , v , w, z e i r . 

To show that ^ is a composition algebra we must verify the three formulas 
in Definition 9.17 for composition algebras. Formulas (a) and (c) are trivial 
to verify and (b) is equivalent to 

(ab + (u, v))(cd + (w, z)) = (ac - (i u , z))(bd — (v, wj) 

+ (aw + du + v x z, cv 4- bz + u x w) 

It is easy to check that this formula follows from two well-known vector 
equations, namely (u x v, u) = (u x v, v) = 0 for all w, v e if and 
(w x w, v x z) = (w, u)(w, z) — (w, z)(v , w) for all w, r, w, z e f". Finally it is 
very easy to choose nonzero elements of ^ of norm 0 so ^ is a split eight¬ 
dimensional composition algebra; that is, a split Cayley algebra. 

Exercise (8) Let be any composition algebra over K , let I be a 
field extension of K of finite degree, and let ©(«) denote the derivation algebra 
of <g. Show that g>(<g(L)) = 

Remark (4) One of the important properties of quaternion and Cayley 
algebras is that they are simple nonassociative algebras. This fact can be used 
to show that all derivations of these two types of algebras are inner; that is, for 
these algebras 3(<g) is contained in U<g) which is the subalgebra of gl(<£) 
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generated by the linear transformations L(X): Y -» XY and R(X): Y -* YX. 
The proof of these results can be found in the work of Schafer [1966, Chap. 3] 
or Section 12.5, exercise (3). 

Exercise (9) Let 2 be a quaternion algebra over K and for X 6 2 
define D{X) = L(X) — R(X). Assuming the results mentioned in the above re¬ 
mark show that 3(2) = {D(X): X e 2, B(X, 1) = 0} and that [D(X), D(Y)] = 
D{XY - YX). Also show that for any 2 there exists a field extension L of 
K of at most degree 2 over K such that (2>(2))(L) ^ sl(2, L). 

Example (3) It is much more difficult to describe the derivation 
algebra of a Cayley algebra but it is not quite so difficult for the split Cayley 
algebra as described in the previous example. For any A e sl{ 3, K) and any 
x, y e X define D(A, x, y) :<£->■ by 

/vi v vV \ a M 1 _> r<* ^ + Au + (a — b)x -Jixtil 

D{A ’ X ’ y >'[v b\^[-A‘v + (.a-b)y + xxu -(x,v)-(y,u) J* 

A long computation involving a few vector formulas verifies that D(A, x, y) e 
and a similarly easy but unpleasantly long computation shows that 
W{A 3 , x u jj), D(A 2 , x 2 , y 2 )] = D(A 3 , x 3 , y 3 ), where 

A s = Ml, A 2 ] + 3xiJ 2 ' - 3X2^' + ((x 2 , *) - (*!, y 2 ))l, 
x 3 = A x x 2 - A 2 x t - 2y 2 x y 2 , 

?3 = -^ 1^2 + A 2 *Yi + 2Xi x x 2 . 

Some of these computations can be found in Schafer’s book [1966] which also 
has a proof that = {D(A, x, y)\ A e sl( 3, A'), x, y e X). 

Using exercise (8) we can now conclude that for any composition algebra 
c 6, MX) must be fourteen dimensional because we can extend the base field 
to say L so that r €{L) is split and then notice that we can compute the dimen¬ 
sion of 3t(^(L)) from the description above. Also notice that for any X e 
and D e MX) we have B(D(X), 1) = 0 since this is easy to verify in the split 
case. Since D{\) = 0 we can conclude that M'€) also can be thought of as the 
set of derivations of <€ acting on the elements of trace 0 that is acting on 

= {X e '4 :B(X, 1) = 0}. This may be restated as: <g 0 is an invariant 
submodule of or the action of 2j{Y>) on ^ gives a seven-dimensional 
representation of We will see in Chapters 13 and 14 that M'£) is 

simple. 

Exercise (10) If <«? is any Cayley algebra, show that MX) acts irreduc- 
ibly on X 0 . Notice that you may assume the X is split. 





CHAPTER 10 

SOLVABLE LIE GROUPS 
AND ALGEBRAS 


We now start the structural development of Lie groups and algebras. 
First we define a Lie group to be solvable if it is solvable as an abstract 
group. Then using the “derivative” of these results we discuss solvable Lie 
algebras. Thus we show that a connected Lie group is solvable if and only if 
its Lie algebra is solvable. Finally we discuss Lie’s theorem which involves 
finding a common characteristic vector for a solvable Lie algebra of endomor- 
phisms acting on a complex vector space. This eventually yields that the 
matrices representing a solvable Lie algebra of endomorphisms acting on a 
complex vector space can be put into triangular form by using a suitable basis 
of the vector space. Once again, all fields in this chapter will be assumed to 
be of characteristic zero. 


1. Solvable Lie Groups 


Let G be an abstract group and let A and B be subgroups of G. Then we 
have the following notation. 

(1) We denote by ( A , B) the subgroup of G generated by all elements 
xyx~ l y~ l for x e A, y e B. 

(2) If A is a normal subgroup of G we write A*a G or G>L 

Note that if A and B are normal subgroups of G, then (A. B) is a normal 
subgroup of G. 
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Definition 10.1 Let G (1) = (G, G) and define by induction G (k+1) = 
(G^ k \ G (fc) ). Then we have the sequence of normal subgroups 

Thus G is solvable if this sequence is finite and terminates at { e }; that is, there 
exists n so that G (n) = {e} and G is called solvable of length n. 

From results of Lang [1965] we have the following theorem: 

Theorem 10.2 Let G be an abstract group. Then the following are 
equivalent. 

(a) The group G is solvable. 

(b) There is a finite sequence of subgroups 6 = G 0 >G 1 >G 2 ,,, >G n = 
{e} such that G k /G k+i is commutative for A; = 0, 11. 

Proof First we observe that by induction each G (k) is a normal subgroup 
of G. Now assume (a). Then note from the definition of ( G , G) that G/G (1) is 
commutative and by induction and definition, G^/G^ I + 1 ^ is also commutative. 
Thus we have (b) by taking G k = G (k) . Conversely, assume we have a descend¬ 
ing sequence 

= >G„ = {e} 

with Gi/G i+1 commutative. Then G/G 1 being commutative implies 
xyx 1 y 1 G 1 — eGi which yields G x G* 1;> . Now assume 

G* => G (fc > 

Then since GJG k + 1 is commutative we see 

O k+1 => (G & ) (1) => (G (k) , G (fc) ) = G (fc+1) . 

However, since G n = {e} we see G (n) = {e} which gives (a). 

Corollary 10.3 (a) A subgroup H of a solvable group G is solvable. 

(b) If G is a solvable group of length n and H a normal subgroup, then 
G/H is a solvable group of length less than or equal to n. 

(c) If G is a group and H is a normal solvable subgroup of length n such 
that G/H is solvable of length tn, then G is solvable of length less than or equal 
to n + m. 

Proof (a) We just note that by induction <= G (A) . 

(b) Let G = G/H. Then by induction we see that (G) (fc) = G (k) using 
71 : G G/H = x -+x = xHi s a homomorphism. Thus the series for G yields 
the series G>G (1) > iii >G (B) — {eH}. 
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(c) Note that from the series Gt>G (1) c>--* t >G (n) = {e//} we obtain 

Gc>G (1) t=-t>G (n) and G (w) <= H . However, since H is solvable we have 

• • • t>{e} and we put these two series together to see that 

G is solvable. 

Definition 10.4 Let G be a Lie group. Then G is a solvable Lie group if G 
is solvable as an abstract group. 

Theorem 10.5 Let G be a Lie group. Then the following are equivalent. 

(a) The Lie group G is solvable. 

(b) There exists a finite sequence of subgroups, 

G s= G 0 o G x o- G 2 o * * * c> G n = {e} 

such that each G k is a closed Lie subgroup of G with G k jG k+l commutative 
for k = 0, 1 ,..., n — 1 . 

(c) There exists a finite sequence of closed Lie subgroups 

G = G 0 e> Gi o * • • t> G r = {e} 

such that for k = 0,..., r — 1, we have G k /G k+i is a connected one-dimensional 
group or a discrete group. 

Proof To show (a) implies (b), we recall that if H is a normal subgroup 
of G, then its closure H is a closed normal subgroup of G. Next assume G is 
solvable so we obtain the series 

G>G (1) >G (2) -->G (m) = {e}, 

and let 

G 0 = G and G k = G®. 

Then the G fc are closed normal subgroups (and, therefore, Lie subgroups) 
such that G 0 c> Gj o- • • • o G m = {c}. Furthermore G k jG k+1 is a Lie group which 
is commutative, for let n : G-^G/G k+1 . Then since n(G (k) ) is commutative, 
we have 

n(G ik) ) 

is commutative. However, since n is continuous, 

n(G k ) = n{G^) c tt(G<^) 

so that G k /G k+1 = n(G k ) is commutative. 

The converse (b) implies (a) is clear. Also (c) implies (b) is clear, so it 
remains to show (b) implies (c). Thus let H k be the connected component of 
the commutative Lie group GJG k+1 . Then by the results outlined in the 
exercise (1), Section 6.5, we have for the G k in (b) 

GklG k+ i = H k x D k 

where D k is a discrete group, and H k = R q(k} x T p(k) by Theorem 6.20. 
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Next let n:G k ^> G k /G k+1 . Then in the series for (b) we replace each of the 
terms G k by the series [with p = p{k), q = q(k)] 

T9 x D k )^n~\R p - 1 xT q x D k )t> ■ ■ ■ t>n~\T q x D k ) 

071 1 (T q ~ 1 x D k )t>~ ■ ■ ■ 07 I -1 (Z> 4 ). 

Thus we obtain the series in (c). 

Exercise (1) Let V be a real vector space of dimension m and regard 
GL(V) as the set of all nonsingular mxm real matrices. Let H be the subset 
of all matrices of GL(V ) of the form 

a il * 

0 

where a\ u ^ 0 and * arbitrary real numbers. Show H is a solvable Lie subgroup 
of GL(V). 


2. Solvable Lie Algebras and Radicals 


Let g be a finite-dimensional Lie algebra over a field K and let h, k be 
subspaces of g. Then we shall use the following notation. 

(1) We denote by [hk] the subspace of g generated by all products [jcy] 
for x e h and y e k. In particular, g n> = [gg] is a subalgebra of g. 

(2) If h is an ideal of g, then we write g >-/? or h<i g. In particular, note 
gc=~g (1) . A Lie algebra g is abelian or commutative if g (1) = {0}. 

Definition 10.6 Let g be a finite-dimensional Lie algebra over K, set 
ff (1) = Iffffl and define by induction 

g (k+1) = [g m g (k) l 

From the Jacobi identity for g we obtain 

£t>0 (1) C=>0< 2 >D- 

and we call g solvable if there exists n with g {n) = {0}. The smallest such n is 
called the length of the solvable algebra g. 





2 . SOLVABLE LIE ALGEBRAS AND RADICALS 


205 


Theorem 10.7 Let g be a finite-dimensional Lie algebra over K. Then 
the following are equivalent. 

(a) The algebra g is solvable. 

(b) There exists a sequence of subalgebras g = g§^9 \* * ' ^9r = W 
so that the quotient algebra g k lg k +i is commutative. Each g k can be taken 
to be an ideal in g. 

(c) There exists a finite sequence of subalgebras g = 9o^9\^ '** ^ 
g s = {0} such that dim g k /g k+i is 1. In general g k is not an ideal in g but only 

in g k ~\- 

Proof The equivalence of (a) and (b) is similar to those for groups in 
Section 10.1. Thus, for example, if g is solvable, then take g k = g (k) for k = 1, 
r to obtain the sequence in (b) and also note [g {k) g {k) ] = so the 

desired quotient algebra is commutative. _ 

Next assume (c) where we have gjg k + 1 = KX =KX + g k +i since g k lg k + % 
is one dimensional. Then since [XX] = 0 we have gjg k + 1 is a commutative 
Lie algebra. Thus (c) implies (b). Conversely, if the sequence in (b) is such 
that gjg k+1 =KX x 4- * * * +KX r + g k+1 is commutative, then each subspace 
H(i)=KX l + --+KX i +g k+l is an ideal in g k /g k+1 for r. 

Thus the corresponding subspace h(i) generated by {X u ..., XHyg k +u where 
X t + g k +t=X i eg k lg k + 1 , is an ideal in g k . Thus we obtain a sequence 

g k = h r c^h r -i o* * * * £>g k + x 

so that the quotient ideals are one dimensional and this yields (c). 

The proof of the following is similar to Corollary 10.3. 

Corollary 10.8 Let g be a Lie algebra containing the Lie subalgebra h. 

(a) If g is solvable, then h is solvable. 

(b) If g is solvable and h an ideal of g y then gjh is solvable of length 

less than or equal to the length of g . 

(c) If A is a solvable ideal of g such that gjh is solvable, then g is solvable. 

Exercises (1) Let g denote the set of m x m matrices of the form 


a 22 


0 
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where are arbitrary in K . Show g is a solvable Lie subalgebra of gl(V ). 

(2) Prove the following isomorphism theorems for Lie algebras. 

(i) Let /: g g be a homomorphism of the Lie algebra g onto the Lie 

algebra g and let k = ker(/). Then h is an ideal of g if and only if the 
inverse image = h is an ideal of g such that h => k. If this is the case, 

we have gjh s g/h ; that is, gjh s ( g/k)l(h/k ). 

(ii) Let h and k be ideals of the Lie algebra # and let f\g^>gjk be 
the natural homomorphism. Then h + k — f~ 1 {f(h)) and {h + k)/k ^ hjh n k. 

(iii) Also show that the sum h + k and product [hk] of the ideals h 
and k are again ideals of g. 

(3) Let g be a solvable Lie algebra of dimension n over K. Show that g 
is a semidirect sum of an ideal h of dimension n — 1 and a one-dimensional 
subalgebra. 

(4) Show that if g is a solvable Lie algebra over K , then P® K g is a 
solvable Lie algebra over the algebraic closure P of K. 

Theorem 10.9 Let G be a Lie group with Lie algebra g . 

(a) If G is solvable, then g is solvable. 

(b) If G is connected and g is solvable, then G is solvable. 

Proof (a) If G is a solvable Lie group, then we have a sequence 
G = G 0 >G 1 >’ ,< >G n = {e} with each G k a closed normal Lie subgroup 
so that G k /G k + 1 is commutative. Then we obtain the corresponding sequence 
g = 9o^9i g n = { 0 } of ideals of g so that g h lg h+1 is a commutative 

Lie algebra. 

(b) If G is connected and g is solvable, then we shall show G is solvable 
by induction on the length of g. Thus let • • -og (n ~ 1) t>g (n) = {0} 

be the sequence for g and let K be the Lie subgroup of G generated by 
exp g (n ' l \ Then K is a commutative normal subgroup of G (since 0 (n ~ 1) is 
a commutative ideal of <?) and its closured = H is also a commutative normal 
Lie subgroup of G. Now let h be the Lie algebra of H. Then h is a commutative 
ideal of g and c= h. From this we have [exercise (2) above], g!h = 

W- 1 ) )/(W" 1> ) and since ^"“^is solvable of length less thanorequalto 
n — 1 we have gjh is solvable of length less than or equal to n — 1 (Corollary 
10. 8 ).Thus by the induction hypotheses G/H is solvable and since H is solv¬ 
able, we have G is solvable using Corollary 10.3. 

Exercise (5) If G is a connected solvable Lie group, then there exists 
a sequence G = 6 0 >G 1 t>-** t> G m = {e} where all the G k are closed connect¬ 
ed Lie subgroups such that GJG k + l are one dimensional. In particular, G 
contains a connected solvable normal subgroup H with dim H = dim G — 1. 



2. SOLVABLE LIE ALGEBRAS AND RADICALS 


207 


Lemma 10.10 Let g be a finite-dimensional Lie algebra over K. Then 
there exists a unique maximal solvable ideal of g : namely the sum of the 
solvable ideals of g. This maximal solvable ideal is called the radical of 0 
and is denoted by r. Moreover g\r is {0} or contains no proper solvable ideals; 
that is, the radical of gjr is {0}. 


Prooe Let h and k be solvable ideals of tf.Then the vector subspace > h + k 
is an ideal of g. Now by the above exercise (2) we see (h + k)jk s hj(hr>k) 
and since hnkczh is solvable we have h/(h u k ) is solvable. Thus we have 
(h + k)lk is solvable and k is solvable so that by Corollary 10 8, h + k is 
solvable. Thus since g is finite dimensional, the solvable ideal of maximum 
dimension is unique and by the above, contains every solvable ideal of g; 

denote this maximal solvable ideal by r. . 

Next let h = hjr be a solvable ideal of g = gjr where h is some ideal ot g 
with h => r. Then since hjr is solvable and r is solvable we have by Corollary 
10.8 that h is solvable. Thus he: r, so that R = {0}. 


Definition 10.11 Let G be a Lie group with Lie algebra g and let r be 
the radical of g. Then we define the radical of G, R = rad G, to be the connected 
Lie subgroup of G whose Lie algebra is r = rad g . 

Proposition 10.12 Let G be a Lie group with radical R. Then R is closed 
and R is the maximal solvable normal connected Lie subgroup of G. 


Proof Let R denote the closure of R. Then R is a normal, solvable Lie 
subgroup (since it is closed). Thus its Lie algebra r is solvable (Theorem 10.9) 
so that r = P and consequently R = R ; that is, R is a closed, normal, solvable 
Lie subgroup of G. The fact that R is maximal among connected Lie subgroups 
with these properties also uses the maximality of r. 


Corollary 10.13 The radical of GjR equals {eR}. 


Exercise (6) Show that the radical of a Lie algebra g is the smallest 
ideal h of g such that the radical of gjh is {0}; that is, if h satisfies this condi¬ 
tion, then r <=. h. 


Definition 10.14 (a) A finite-dimensional Lie algebra is called semi- 

simple if it has no proper solvable ideals. Thus g is semisimple if and only if 
r = {0}. Similarly a Lie group G is semisimple if its radical R - (e|. 

(b) A Lie group G is simple if its Lie algebra g is simple. That is, 
[ gg ] / {0} and g has no proper ideals. 
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We shall eventually show that a semisimple Lie algebra over a field of 
characteristic 0 is a direct sum of simple Lie algebras which are ideals. Con¬ 
sequently many problems involving semisimple Lie groups can be done in 
terms of simple Lie algebras. 

Exercise (7) Show that the center of a simple Lie group is discrete (note 
Section 6.5 and Corollary 7.14). 


3. Lie’s Theorem on Solvability 


We now describe how a solvable Lie group or Lie algebra of endomor- 
phisms can be represented by triangular matrices. To do this we must com¬ 
pute characteristic roots so we consider real Lie groups or algebras as acting 
on complex vector spaces. 

Definitions 10.15 (a) Let K be a field of characteristic 0 and let V be 
a finite-dimensional vector space over K. Let Te End K (F) and X e K. Then set 

V x = {X e V: TX = XX} 

and 

V(X) = {X e V: (T— XI) n X = 0 for some n e N} 

where N is the set of natural numbers (which are greater than 0). If V x ^ {0}, 
then X is called a characteristic value or eigenvalue of Tand 0 # X e V A is called 
an eigenvector or characteristic vector of T with characteristic value X. If 
V(X) # {0}, then X is called a weight of T and V(X) a weight space and 0 # 
X e V(X) is called a weight vector of T. 

A characteristic value or a weight X of T is a solution of the equation 
det(Zx — T) = 0 and if all the solutions to this (characteristic) equation are in 
K, then we say that the characteristic values or weights are in K; recall the 
definition of a split endomorphism in Section 9.2. 

(b) Let N <= End x (K), let /: N ->K be a function, and set 

V f = {X € V: for all Te N, TX =f(T)X} 

and 

V (./) = {X e V: for all Te N, there exists n > 0 with (T-f(T)I)"X = 0). 

If V { j* {0},then/is called a characteristic function on A and 0 #Xe V f is called 
a characteristic vector of N for the characteristic function /. Similarly one 
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defines a weight function, weight space, and weight vector in case V(f ) # {0}. 

Thus these functions on N assign to each Tin N a characteristic root/(T) 
of T. Of course, in actual computations, the characteristic roots discussed 
above might be in the algebraic closure of K. 

With these definitions and results on canonical forms of endomorphisms 
[Jacobson, 1953, Vol. II; Lang, 1965] we state the following: 

Proposition 10.16 Let F be a finite-dimensional vector space over K 

and let T e End K (F) have its (distinct) weights .A m in AT. Then the weight 

spaces F(A ( ) are T-invariant and F= F(A t ) + ••• + F(AJ (direct sum). 

Remark (1) This direct sum decomposition will be generalized in the 
next chapter to a direct sum decomposition of weight spaces of a nilpotent 
Lie group or Lie algebra. 

Exercise (1) Let V be a finite-dimensional vector space over K , let g 
be a Lie subalgebra of gl(V), and let /: g^K be a characteristic function on 
g. Show/is a linear transformation. 

Proposition 10.17 Let F be a finite-dimensional vector space over R, 
let G be a Lie subgroup of GL(F), and let /: G -► R be a characteristic function 
with f{G) <= R* = R - {0}. Then regarding R* as a multiplicative Lie group, 
the map /: G^R* is an analytic homomorphism of Lie groups. / is fre¬ 
quently called a character of G. 

Proof Let S, TeG. Then for 0j=XeV f we have SX =f(S)X and 
TX = f(T)X. Thus 

STX = Sf(T)X =f(T)SX = f(T)f(S)X. 

However since ( ST)X =f(ST)X this gives/(ST) =/(S)/(T) so that/: G 
R* is a homomorphism. To see that / is analytic, let X lt . • •, X m be a basis 
of Fso that X t is a characteristic vector of G for the characteristic function/. 
Noting that the mappings r : G -» V : S -* SiXj and s : V-> R : i i i 

are analytic, so is the map/= s ° r : G -*• R*. 

Analogous to Lemma 7.15 we have the following result: 

Lemma 10.18 Let V be a finite-dimensional vector space over R and let G 
be a real connected Lie group which is a subgroup of GL(V) and has real Lie 
algebra g. Let IF be a subspace of V. 
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(a) W is invariant under the action of G if and only if W is invariant 
under the action of g. 

(b) For A eg, the vector X e V is a characteristic vector of A with 
characteristic value A if and only if X is characteristic vector of the subgroup 
{exp tA : t e R} for the characteristic function f: exp tA —* e a . 

Exercise (2) Prove results analogous to Proposition 10.17 and Lemma 
10.18 when we take V to be a finite-dimensional vector space over C and let G 
be a real Lie group which is a subgroup of GL(V, C). For example, consider 
the real matrix Lie group G = GL(n, R ) as acting on C" and regard G as a 
subgroup of GL(n, C). 

The following result or some of its equivalent consequences is known as 
“Lie’s theorem on solvability.” We follow the work of Tits [1965] for the 
group proof. 

Theorem 10.19 (Lie’s theorem) Let V be a finite-dimensional vector 
space over C and let G be a real connected solvable Lie group which is a 
subgroup of GL(V, C). Then there exists a nonzero characteristic vector of G 
for some characteristic function. 

Proof We shall prove the results by induction on the dimension of G. 
First, if G is one dimensional and 0 # A eg which is the Lie algebra of G, 
then since g <= gl(V) we see that A has a nonzero characteristic vector X e V. 
However, by Lemma 10.18 and exercise (2), X is also a characteristic vector of 
G. Next assume G is of dimension n and assume as an induction hypothesis 
that we have shown the result for all such groups of smaller dimension. Now 
since G is connected and solvable, G has a connected solvable normal subgroup 
H of dimension n - 1 [exercise (5), Section 10.2], Thus by the induction 
hypothesis we can conclude there is a characteristic function f'.H —> C* = 
C - {0} and analogous to Proposition 10.17 we have/is continuous. 

We shall now show that the subspace V f = {Xe V: SX =f(S)X for 
all S e H} is invariant under G. Thus let X eV f ,S eH, and Te G. Then 

S(T(X)) = (ST)(X) = T(T-'ST)(X) =f(T- l ST)T(X) (*) 

using T l STe H. Thus the number f(T 'ST) is a characteristic value of S 
with characteristic vector T(X ) and also the function k:G-*C*:T-> 
f(T l ST) is continuous. However, since G is connected and the set of 
characteristic values of S is discrete, the image k(G) consists of a single point. 
(This uses the characterization: the topological space M is connected if and 
only if M is mapped continuously into a discrete space implies the image 
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of M consists of a single point.) Thus we have k(T) = k(l) =f(S). Using this 
in (*) we have for any X e V f , TeG, and S e H that S(T(X )) = /(S)T(X) 
which shows by the definition of V f that V f is invariant under the action of G. 

Next by Lemma 10.18 we have that V f is invariant under the action of 
g <r gl(V). Therefore if A eg and A <£h which is the Lie algebra of H, then, 
since the subspace V f is invariant under the linear transformation A, there is 
a characteristic vector 0/Iel) for A. Thus since g\h — RA + h (using the 
hypothesis that dim H is n - 1) we see that X is a characteristic vector for g. 
For let B = aA + bC e g with C eh and for AX = AX e V f and CX = gX 
[using Lemma 10.18(b) applied to h and H] we have 

BX = (aA + bC)X = (aX + bg)X. 

Thus by Lemma 10.18, X is a characteristic vector for G. 

Definition 10.20 Let V be an w-dimensional vector space over the field 
K Then a sequence of subspaces {0} c V y <= V 2 <=■ • • • = V m = V such t ^ at 
dim V { = i for i = ,m is called a flag in V. Let G c GL(V) be a Lie group. 
Then the flag is G-invariant if for every T e G we have T(F ; ) <= V { for i=l,..., 
m. Similarly for a Lie algebra g of endomorphisms, we define a ^-invariant 

flag. 

Proposition 10.21 Let V be an /^dimensional vector space over C and 
let G be a real connected Lie group which is a subgroup of GL(V.C). Then 
the following are equivalent. 

(a) The group G is solvable. 

(b) There exists a flag which is G-invariant. 

(c) There is a basis of V such that the matrices for the elements in G 
can be put simultaneously into triangular form. (The matrices might have 
complex entries). 

Proof Assume G is solvable. Then to show (b) we use induction on the 
dimension of V. From Lie’s theorem there is a one-dimensional subspace W 
of V which is invariant under G. Therefore an element TeG induces a non¬ 
singular linear map 

T : V/W-> VIW: x + W -* Tx + W 

and the map G - GL(V\ W, C ): T-* Tis an analytic homomorphism. Thus the 
image G = {Te GL(VjW, C): TeG} is a real connected solvable Lie group 
which is a subgroup of GL(V/W, C ) and by the induction hypothesis there 
exists a flag in Vj W which is invariant under G 

{0}c: V 2 cz V 3 <=•••<= V = V/W. 
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Now let 7i : V-*V/W, let V, = n~\Vi), and set = W= 7i _1 ({0}). Then 
dim V t = i and {0} <= V 1 <= V 2 <= • • ■ c V„ = V is a flag which is invariant 
under G. 

Next to show (b) implies (c) we choose a basis of V from the corresponding 
flag as follows. Let V l = {XJ. Then since TV X c V t for all TeG we have 
TX t = a n (T)Xi. Next let V 2 = {X u X 2 ) where X t and X 2 are independent 
using dim V 2 /V l — 1. Then since TV 2 aV 2 we have TX 2 = a l2 (T)X l 
+ a 2 i(T)X 2 for all TeG . Continuing in this manner we can choose a 
basis of V so that any TeG has a matrix of the form 

"«n (T) a 12 (T) a ln (TY 

a 22(T) 

. 0 ” ann{T)_ 

with 0 # a n (T) * * • a nn (T) = det T. 

Finally to show (c) implies (a) let G be represented by the group triangular 
matrices as above. Let G 1 be the normal subgroup of triangular matrices of 
the form 

P.. • l 


1.0 ••• 0 lj 

with l’s on the diagonal. Let G 2 be the normal subgroup of G x of the form 

"10 * 

1 0 
1 

. 0 

-° 1 . 

with l’s on the diagonal and 0’s on the next superdiagonal. Continuing 
this way we obtain the sequence 

G>Gi>G 2 t->G„ = {/} 

with GJG i+1 commutative. 

The preceding results on Lie groups can be translated into results on 
Lie algebras via the exp mapping or directly as follows. This proof involves 
some computations we shall see again in Chapter 11. 

Theorem 10.22 (Lie’s theorem) Let P be the algebraic closure of the 
field K and let V be a nonzero finite-dimensional vector space over P. Let g 
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be a solvable Lie algebra over K and let p be a homomorphism of g into 
gl(V, P). Then there exists a vector 0 e V which is a characteristic vec or 
for all the members of p(g ) for some characteristic function. 

Proof We prove this by induction on the dimension of g. For dim #=1, 
the theorem follows from results on canonical forms (Proposition 10.16). 
We assume the results hold for Lie algebras of dimension less than dim g. 
From Theorem 10.7(d) we can find an ideal h in g so that dim g\h - L By 
Corollary 10.8 we have h is solvable so that by the induction assumption there 
exists a characteristic function /: h -* P so that for all S e h 

p{S)X =f{S)X. 

From dim gjh — 1 we can find Teg so that T$ h. Thus g - KT + h. Let W 
be the subspace of V spanned by all the vectors 

X t =X and X k+i = p(T) k X 

for k = 1 2,... . Note that W is p(T)~invariant subspace of V. 

We shall now show: For all S e h, p{S)Y =f(SW for all Y e W; that is, 
IF is p(/i)-invariant and furthermore p(S) =f(S)l on W. 

We first prove by induction that for all S e h and k = 1,2,... 

p(S)X k =f(S)X k + a k -iX k -! + • • ■ + a r Xy (*) 

where a J = a/S) are in P. By the choice of X, =X the result holds for L = 1. 
Assuming (*) for k , we have 


p(S)X k+i = p(S)p(T)X k , definition of X k+1 

= p([ST])X k + p(T)p(S)X k 
= p([ST])X k + p(T)(f{S)X k + a k . k X k 
=f(S)p(T)X k + b k X k + ’'' + 

=/(S)X*+1 + b k x k + • ■ • +b l x 1 


+ ■ ■ ■ + o k X i) 


using 1ST] 6 h and the induction assumption. 

We next prove p(S)Y=f{S)Y for all Ye W. From (*) and the definition 
of X k we first observe that W is p(#)-invariant. Next note that from the above, 
the restriction p(S) | W has matrix 


/(S) 


m 


so that tr p(S) I W^f(S) dim W for S e h. Next note p(S) and p(T) map W 
ZoVJPto* MIST,UlWD-KM as endomorphisms of W. 
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Thus since tr (AB) = tr (BA) for endomorphisms, we see 0 = tr p([ST]) = 
/([ST]) dim W. Since dim W > 1 this gives f([ST]) = 0. Thus 

p(S)X k+1 = p(S)p(T)X k 

= P([ST1)^ + p(T)p(S)X k 
=f([ST])X k +f(S)p(T)X k =f(S)X k+l 
that is, p(S)Y =f(S)Y for all Y e W. 

Since W is p(7>invariant and P is algebraically closed we see that p(T) 
has a characteristic vector A e W:p{T)A = tA. Also p(S)A =f{S)A for all 
S eh and since g =KT+ h we have for anyZ = aT+ S that p(Z)A = ap(T)A 
+ P(S)A = (at +/(S))A. Thus A is a characteristic vector of p(g) and 
F: aT + S -> at +/(S) defines the corresponding characteristic function. 

The formalities in the proof of Proposition 10.21 yield the following: 

Proposition 10.23 Let P be the algebraic closure of the field K and let V 
be a nonzero finite-dimensional vector space over P. Let g be a Lie algebra 
over K and let p be a homomorphism of g into gl(V, P). Then the following 
are equivalent. 

(a) The Lie algebra p(g) is solvable, 

(b) There is a flag in V which is invariant under p(g). 

(c) There is a basis of V such that the matrices for the endomorphisms 
in p(g) can be put simultaneously into triangular form. (The matrices might 
have entries from P.) 

These results apply when we take the field K to be algebraically closed 
itself. Thus K = P and we obtain the following: 

Proposition 10.24 Let g be a Lie algebra over the algebraically closed 
field K. Then g is solvable if and only if there exists a flag in g 
{0} <= g k <=g 2 <= ••• <=g n =g 
such that each g ( is an ideal of g. 

proof Assume g is solvable. Then since g -► ad (g) :I-»ad I is a 
homomorphism of Lie algebras over K, we see that ad(^) is a solvable Lie 
algebra of endomorphisms acting on the vector space g. By Proposition 
10.23(b) there is a flag {0} <= g 1 <=••• <=g n =g which is invariant under ad(#); 
that is, each g t is an ideal of g. 

Conversely, assuming such a flag exists we see that ad (g) is solvable; 
using (b) implies (a) in Proposition 10.23. However. ad:g->ad(g) is a 
homomorphism so that ad(<?) s ^/ker(ad). Since ker(ad) is the center of g 
which is solvable and since gj ker(ad) is solvable, we have by Corollary 10.8 
that g is solvable. 




CHAPTER 11 


NILPOTENT LIE GROUPS 
AND ALGEBRAS 


We continue the concepts given in the preceding chapter and call a Lie group 
nilpotent if it is nilpotent as an abstract group. Then we discuss nilpotent 
Lie algebras and obtain the result that a connected Lie group is nilpotent if 
and only if its Lie algebra is nilpotent. In the last section we consider the 
vector space decomposition which yields the Jordan canonical form for an 
endomorphism and extend this decomposition to a nilpotent group of auto¬ 
morphisms. 


1. Nilpotent Lie Groups 


We now give a variation of the results on solvable groups using some of 
the notation of the preceding chapter. 

Definition 11.1 Let G be an abstract group. 

(a) Let C°G = G and let C n+1 G = (G, C n G). Then C"G >C +, G and we 
have the descending central series 

G = C°G >C'G >C 2 G o •. 

(b) Let C 0 G ={*} and let C n G = n~\Z(GIC n -,G)), where Z(GIC n ^G) 
is the center of G/C„_,C noting by induction C„G-a C„ + 1 G<i G and where 
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n : G -> G/C w _xG is the corresponding projection map. Thus we have the 

ascending central series 

{e} = C 0 G<] QGo C 2 G<i *. 


Theorem 11.2 Let G be an abstract group. Then the following are 
equivalent. 

(a) There is a series of normal subgroups of G 

G — G 0 o G 1 • • • o G s = {e} 

such that (G, G„) c= G„ + 1 for n = 0, ..., .? — 1. 

(b) There exists a positive integer p such that 

Gv>C l G C p G = {e}. 

(c) There exists a positive integer q such that 

• • -o C^G = G. 

Proof Assume there is a series as given in (a). Then by induction we 
have G n => C"G. Thus C S G = {e}. Conversely if (b) holds, then we automati¬ 
cally have a series satisfying (a). 

Next we have (a) implies (c), for if we have a series as in (a), then we shall 
show by induction G s _„ c C„G so that for n ~ s we obtain C S G = G. Thus 
{e} = G s c C 0 G ~ {e} and assume G s _ f <= C*G Then 

(G/C f G, G s _ i . 1 /C i G) c= G s _ f /C f G <= C f G/C t G = {e} 

using the induction hypothesis for the second inclusion; that is, 
(G, cz C £ G. Thus if 7r:G-»G/C/G is the projection, we see that 

Gg-i^i c n~ 1 (Z(G!C i G )) — C i+1 G , using the definition of C f+1 G. 

Conversely to see (c) implies (a), we first note that 

(G, CiGyC^G c (G/Cj-jG, QG/C^G) = {e} 

using C t G — n~ 1 (Z(G/C i _ 1 G)), where n: GG/C^G. Thus (G,C f G)c: 
C^jG so that for C^G = G = G 0 , C^jG = Gj,..., C q _ m G = G m , etc., we 
see that the series in (c) yields the series in (a). 

Definition 11.3 An abstract group G is nilpotent if it satisfies any one of 
the conditions of Theorem 11.2. 

Remarks (1) Note that nilpotency involves a descending series using 
commutators of the terms of the series with the group, whereas solvability 
involves a descending series using commutators of the terms of the series with 
itself. 
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(2) Subgroups, quotient groups, and finite direct products of nilpotent 
groups are nilpotent. The proofs run as expected. For example, if G f are 
groups with C^G* = {e,-} for i = 1,..., m and if G = G x x • • * x G m , then 
C n G = {e} 7 where n = max{«!,..., n m }. 

Example (1) Let V be an m-dimensional vector space over K and let 
G c= GL(V) be the set of triangular m x m matrices of the form 

~ a 

a * 

_0 a m 

for a # 0 in K. Then G is nilpotent group, for let G t be the subgroup of 
matrices of the form 

n t i 


[o --tl 

and let G 2 be the subgroup of matrices of the form 

"10 0 *" 

1 0 

0 

* . 0 

_0 1 . 

Then G oGj t=-G 2 ■ >G m _ 1 — {/} and (G, G n ) <= G n + 1 . 

Theorem 11.4 Let G be a Lie group. Then the following are equivalent. 

(a) As an abstract group G is nilpotent. 

(b) There exists a series G = G 0 G x o- * • • G s = {^} where each G n is 
a closed normal Lie subgroup of G and (G, GJcG„ +1 . 

(c) If C°G = G and C" +1 G = (G, C"G), then there exists a positive integer 

p such that G t>C 1 G --t>-C p G — 

Proof Showing (c) if and only if (b) is similar to Theorem 11.2; (b) 
implies (a) is also clear. Next assume (a). Then there is a series of normal 
subgroups G ^G x o* • • t>G s = {e} with (G, G n ) c G n + 1 . Consequently we 
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obtain G s = {e}, using G as Hausdorff, and G>S 1 > ,, *>G s = {e} with 
(G, G n ) cz G n+1 which proves (b). 

Definition 11.5 A Lie group G is a nilpotent Lie group if it is nilpotent as 
an abstract group. 

Exercises (1) Let G be a Lie group. Show that C P G = {e} if and only if 
C P G = {e} if and only if C p G = G. 

(2) Show that if G is a nilpotent Lie group and H is a closed normal 
subgroup of G, then G/H is a nilpotent Lie group. 

(3) Let G be a Lie group and H a nilpotent subgroup. Then show its 
closure H is nilpotent. 


2. NOpotent Lie Algebras 

Let G be a Lie group with Lie algebra g. We shall now define the notion 
of a nilpotent Lie algebra so that if G is connected, then G is a nilpotent Lie 
group if and only if g is a nilpotent Lie algebra. 

Definition 11.6 (a) Let g be a Lie algebra over a field K and let C Q g — g 
and C tt+1 g = [g C n g]. Thus we see that 

C x g = [gg] 9 •••> C k g = (ad g)\g), ... 

are ideals of g and we obtain the descending central series g = C°g C x g o- • • •, 

(b) Set C 0 g = {0} and C n+1 g = n~ x (Z(g/C n g)), where by induction 
9 and n : 9 ~ ¥ 9l^n9 ^ the Lie algebra homomorphism and Z(g/C n g) 
is the center of the Lie algebra g!C n g . Thus we see that C 0 g ~ {0}, C x g = Z(g ), 
etc. are ideals of g and we obtain the ascending central series 
{0} = C 0 g<i •**. 

Theorem 11.7 Let g be a Lie algebra over K. Then the following are 
equivalent. 

(a) There exists a sequence g —g 0 o--* og s = {0} where all the 

g n are ideals of g such that [gg n ] <= g n + x . 

(b) There exists a positive integer p such that g = 
C°g t^C x g c-- • • t>C p g = {0}. 

(c) There exists a positive integer q such that {0} = 

C 0 g <3 C x g <i ***<3 C q g =g. 
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(d) There exists a positive integer r such that for all X t ,..., X r e g we 
have ad X k ° ad X 2 ° • • * ° ad X r = 0. 

Proof The equivalence of (aH c ) are similar to Theorem 11.2. To see 
(b) if and only if (d) just use the fact that C k g is generated by the elements 
(ad Xi o ad X 2 o • • • o a d X k ) Y for any X t ,..., X k , Y e g ; see Definition 11.6. 

Definition 11.8 A Lie algebra g is called nilpotent if it satisfies any one of 
the conditions of Theorem 11.7. 

Remarks (1) Subalgebras, quotient algebras, and finite direct sums of 
nilpotent Lie algebras are again nilpotent. 

(2) A nilpotent Lie algebra is solvable, for by induction we obtain 

CV= , 0 < " +1) - 

Exercises (1) What can be said about the following: Let g be a Lie 
algebra with h an ideal of g so that gjh is nilpotent and h is nilpotent. Then g 
is nilpotent ? 

(2) Let g denote the set ofmxra matrices with elements in K of the form 



where a u = a 22 = • • • = a mm . Then g is a nilpotent Lie algebra, for let g k be 
the subalgebra of matrices of the form 



and let g 2 be the subalgebra of matrices of the form 



Show that g t>g 2 c> "' — {0} an< ^ \-99iS c 9n+i • 
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Proposition 11.9 Let g be a Lie algebra over K. Then g is solvable if and 
only if [gg] is nilpotent. 

Proof Suppose [gg] is nilpotent. Then [gg] is solvable. Also since g/[gg] 
is a commutative Lie algebra, it is solvable. Thus by Corollary 10.8, g is 
solvable. 

Conversely, first let P be the algebraic closure of K and let g be a solvable 
Lie algebra over P contained in gl( V , P), where Kis a finite-dimensional vector 
space over P. Then by the results following Lie’s theorem (Proposition 10.23) 
there is a basis of V so that the matrices of g have triangular form 

~a xx * " 

a 22 

. 0 a mm _ 

and consequently the matrices for elements in [gg] have the form 

r°o i 


Lo ' oj. 

Thus by exercise (2), [gg] is a nilpotent Lie algebra. 

Next if <7 is an arbitrary solvable Lie algebra over P, then ad {g) is solvable 
and therefore [ad(^), ad(#)] = ad([^]) is nilpotent. However, since ad : g -* 
a d (g) is a Lie algebra homomorphism with ker(ad) = Z(g) we see that 
9 = 9t z (g) = ad (g). Therefore g (2> = [gg] £ ad([^]) is nilpotent. Con¬ 
sequently, there exists a positive integer p such that 

{ 0 } = C p+l g™ = C p+1 g™lZ(g). 

Thus C p+1 g ( - 2) <= Z(g) so that C p+2 g (2) = { 0 }; that is, g (2) = [gg] is nilpotent. 

Finally, if g is a Lie algebra over K, we let h = P ® k g be the tensor pro¬ 
duct of the algebras P and g over K as in Section 9.1. Then h is a Lie algebra 
over P and a straightforward computation shows that if g is solvable, then 
h is solvable. Thus since [gg] e [hh\ we use the results of the preceding para¬ 
graph to conclude [gg] is nilpotent. 

Exercise (3) If g is a nilpotent Lie algebra over R, then show that its 
complexification g c is also nilpotent. 
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Theorem 11.10 (Engel’s theorem) Let V be a nonzero finite-dimen¬ 
sions! vector space over the field K and let g be a Lie subalgebra of gl( V) which 
consists of nilpotent linear transformations (that is, A" = 0 for some ri). Then 
there exists a nonzero vector Xe V such that for all A eg, we have AX = 0. 


Proof First we shall show that A e g being a nilpotent linear transforma¬ 
tion implies ad g A is a nilpotent linear transformation acting on g. Thus since 
gl(V) = End(K) as sets, we can define the endomorphisms 

R(A) : gl(V) -► gl(V) \Z^ZA and L(A) : gl(V) -* gl(V): Z -* AZ 


and see (ad A)(Z) = AZ - ZA = (L(A) - R(A))(Z) in gl(V). Also noting that 
L(^4 )jR(^4) = R(A)UA) we have by the binomial theorem for any integer 
k^O, 

(ad AfZ = [L(A) - R(A)] k Z 


I 


k 

= 1 
f = 0 


(_1 y^A k - l ZA‘. 


However, since A € g is nilpotent, all the factors A k ' or A are 0 for suitably 
large k or i. Thus ad g A is nilpotent. 

Next we shall use induction on tn = dim g to prove the result. For m = 1 
we have g = KA where A is a nonzero nilpotent linear transformation. Thus 
since there exists Xe V with AX = 0, the same result holds for every B = 
g Now assume as an induction hypothesis that the result holds for Lie 
algebras of dimension less than m and let Abea proper subalgebra of g of 
maximum dimension. Then by the results of the above paragraph, ad g A is 
a nilpotent endomorphism on g for all Ash. Thus since ad A . h —* h we see 
ad A induces a nilpotent endomorphism A on the vector space g = g\h. 
Furthermore the set h = {A : A s h) i_s a subalgebra of gl(g) which consists of 
nilpotent endomorphisms and dim h <m. 

By the induction hypothesis with V=g we can conclude that there exists 
B # 0 in g such that for all A s R we have AB = 0; that is, there exists Beg 
with B $ h and [h, B] <= h. Thus the subspace h + .Oof# is a subalgebra which 
contains h. However, by the maximal choice of h we have h + KB =g. 

Finally let W = {Z s V : AZ = 0 for all A s h}. Then by the above induc¬ 
tion hypothesis W A {0}. Furthermore for Ash and Beg as above we have, 
since [A, B] e h, that for any ZeW, 


A(BZ) = (AB)Z = [A, B]Z + (BA)Z = 0. 

Thus by the definition of W we obtain BWcz W. However, since Beg is 
nilpotent on V we have B is nilpotent on W. Consequently there exists 
0 # Xe W with BX = 0 and since g = h + KB we see this X has the desired 
property. 
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Corollary 11.11 Let V be a finite-dimensional vector space over K and 
let g czgl(V) be a Lie algebra of nilpotent endomorphisms of V. 

(a) There exists a basis of V such that the matrices of the endomorphisms 
in g relative to this basis have the form 

"0 *" 

0 

0 0_. 

(b) g is a nilpotent Lie algebra of endomorphisms. 

(c) The associative algebra g* generated by the endomorphisms of g is 
a nilpotent associative algebra; that is, there exists a positive integer r such 
that for any endomorphisms A x ,A r eg* we have A X A 2 * * * A r = 0. 

Proof (a) Let X x e V be such that AX x = 0 for all A eg. If the sub¬ 
space V x = KX x ^ V , then each A eg induces a nilpotent endomorphism A 
on £he nonzero^vector space V= V/V x . Thus we can find X 2 = X 2 + V x ^ 0 
in V such that AX 2 = 0 for all A eg; that is, there exists X 2 e Land X 2 <£ V x 
with 

AX 2 = a 2x (A)Xi + 0X 2 

for all A eg. Continuing by induction we obtain a basis X x ,..., X m of V such 
that for all A eg, 

AX x = 0 and AX„~ 0 mod (X l9 X n - x ) 

where (X x , X n _ x ) denotes the subspace spanned by these vectors. Thus 
the matrix for A has 0’s on and below the diagonal. 

Part (b) follows from (a) and exercise (2), while (c) follows from (a) and 
matrix multiplication. 

Corollary 11.12 Let g be an abstract Lie algebra over K. Then g is a 
nilpotent Lie algebra if and only if for all X e g we have ad g X is a nilpotent 
endomorphism on g. 

Proof If g is nilpotent, then from Theorem 11.7(d) we have ad X is nil- 
potent. Conversely, if each ad X is nilpotent, then by Corollary 11.11, (ad g )* 
is a nilpotent associative algebra. Thus there exists a positive integer p with 
{0} = (ad g) p g — C p g\ that is, g is nilpotent. 

Theorem 11.13 Let G be a connected real Lie group with Lie algebra g. 
Then G is a nilpotent Lie group if and only if g is a nilpotent Lie algebra. 
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Proof First assume that G is nilpotent and let G = 
6 0 > Gi>'">G„ = {e}bea series of closed normal subgroups of G such 
that (G, G k ) <= G k+1 . Consequently we have the corresponding series g = 
ffo [>••• = {0} of ideals of g. Next (6, 6 t ) c implies 

[gg k ]c-g k+1 , for let Xeg, Yeg k . Then for t near OeR we have from 
Theorem 5.16(c) that 

(exp tX, exp tY) = exp(/ 2 [ YY] + o(f 3 )) 

is in G k + 1 . However, from the characterization of the Lie algebra of G f + 1 in 
Theorem 6.9, this implies t 2 [XY] + o(t 3 ) eg k+1 which yields [XY]eg k+1 -, 
that is, [gg k \<^g k +i . 

We now sketch the main parts of the proof of the converse and leave the 
details as exercises. First, since g is a nilpotent Lie algebra, we see that ad g is 
a nilpotent Lie algebra of endomorphisms (with index of nilpotency N). 
Thus for any Z e g, ad Z is nilpotent. Consequently in the expansion of 
Theorem 5.18 

exp X • exp Y = exp F(X, Y) 

for X, Y in g near the origin 0 eg, we have that the Campbell-Hausdorff 
formula 

F(X, Y) = X+ Y + ±[XY] + ■■■ 

is of finite length since (ad X) N — (ad Y) N — 0. 

Secondly, from the chain of ideals 

g c *g k &~g 2 £>••• r >g n - {0}, 

where [gg k ] <=g k+1 , we obtain for the connected subgroup G k generated by 
exp g k the chain 

G t><Ji >G 2 t=>- • • >6, = {e}. 

Finally, for Xeg. Ye g k near enough the origin 0, we have for x = exp X, 
y = exp Y 

xyx-'y-' = exp([AT] + •••) = exp P(X, Y) 

where P(X, Y) is a finite sum of commutators, using the first part of the proof. 
Now each commutator term in P(X, Y) contains Yeg k . However, g k is an 
ideal of g so that [gg k ]<=g k +1 and therefore P(X, Y)eg k+i . Thus 
■xy* -1 J rl eexp0* +1 <= G k + 1 and by induction on the length of products of 
elements G and G k we obtain (G, G k ) a G k + i . 

Exercise (4) Show that each commutator term in P(X, Y) contains 
Yeg k . Also complete the induction. 
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(5) Let g be a finite-dimensional Lie algebra over K. 

(i) If /*! and h 2 are ideals of g which are nilpotent Lie subalgebras, show 
that + h 2 is a nilpotent ideal of g . 

(ii) Show that a maximal nilpotent ideal n of g exists; n is called the 
nilpotent radical of g. See the work of Bourbaki [1960] and Jacobson [1962] 
for more results of this nature. 


3. Nilpotent Lie Algebras of Endomorphisms 

We shall now generalize the process of finding the Jordan canonical form 
matrix of an endomorphism to the process of decomposing a vector space 
into weight spaces relative to a nilpotent Lie algebra of endomorphisms; that 
is, finding simultaneously “Jordan forms” for a nilpotent Lie algebra of 
endomorphisms. Recall from Section 9.2 that a Lie algebra over K is split if 
all the characteristic roots of ad X are in K for all Xeg. 

Theorem 11.14 Let V be a finite-dimensional vector space over K , and 
let g be a split nilpotent Lie subalgebra of gl(V). 

(a) There exists a direct sum decomposition 

where V((p k ) = {Xe V : for all Teg , (T— <j) k (T)I) p X = 0} are ^-invariant 
weight spaces for g for k = 1 ,..., m. 

(b) There exists a basis of V so that the matrices of the endomorphisms 
in g relative to this basis all have the block form 



(c) The functions <f) k :g-*K are linear; that is, (fr k eg *. Furthermore 

4>k(lg,gJ) = { o}* 
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Proof We break the proof into several parts. First we have the following 
formula for an associative algebra A. Let s,te A, and let s (0) = s, s ( ) = 
ts - st = (ad t)s, and s (k> = (ad t) k s. Then we obtain by induction for k = L 

2 ,... 



Next we have the following result. 

Lemma 11.15 Let V be a finite-dimensional vector space over K, and let 
g be a split nilpotent Lie subalgebra of gl(V). Let Teg, A e K and let 

V(X) = {XeV:(T- U)"X = 0 for some n} 
be a weight space for T. Then V(X) is a ^-invariant subspace of V. 

Proof We noted in Section 10.3 that V(X) is a subspace. Since g is a 
nilpotent Lie algebra, the Lie algebra h = g + KI is also nilpotent where / is 
the identity endomorphism. Therefore, by Corollary 11.12, [ad(T— /J)] N == 0 
for some fixed N. Now for Sega End(F) let = [ad(T — A/)]S, S — 
[ad(T- /./)] 2 S, etc. Then for Xe V(l) with (T- U) m X = 0, we see by choos- 
xng lc — N + m and using (*) that 

(T- XI)\SX) = Uf- l X = 0 

noting S (N) = [ad(T- A/)] N S = 0. Thus by definition of K(A) we see 
SXe V(X) ; that is, V(X) is ^-invariant. 

Proof of Theorem 11.14 (continued) To prove part (a), we use in¬ 
duction on the dimension m of V. If m = 1, then every Teg has a characteris¬ 
tic root so that TX - X{T)X for V = KX. This yields the result in this case. 
For m > 1 we let Teg , and by Proposition 10.16 we have the direct sum 
y = + • • • + V(X n ) where the F(A f ) are weight spaces for T. By Lemma 

11.15 the F(/l f ) are ^-invariant, and consequently g restricts to a nilpotent 
Lie algebra of endomorphisms on each F(A e ). Thus we conclude the proof by 
induction, because we can assume Thas at least two distinct characteristic 
roots (why?) so that the dimension of F(A f ) is less than the dimension of F. 
Now since F is finite dimensional, we see that there are only finitely many 
distinct weights of g- 

To construct the basis of F which gives the matrix in (b), it suffices to find 
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a basis for each K(0 ; ) which gives the corresponding block matrix. Thus let 
V(<j)) be a typical weight space as in (a). Then there is a nonzero X e V((j>) 
such that TX = <j)(T)X. To see this, we use Lie’s theorem (Theorem 10.23) 
replacing algebraic closure by “ split ” as follows. Since g is nilpotent on V(<p), 
it is solvable on Thus there is a nonzero X e V(<p) and a characteristic 
function Fso that for all Teg, TX = F{T)X. Therefore 

[F(T) - <j>(T)]X = [T- <KT)I]X, 

and by induction 

[F(T>- <p(T)] k X = [T- <f>(T)I] k X = 0 

for k large enough, remembering X e V(4>). Thus F(T) = 4>(T); that is, 4>{T) 
is the only characteristic root. 

Thus, for X 2 = X as above, the one-dimensional subspace W = KX i is 
^-invariant. Set F = V((j))[W. Then g induces a nilpotent Lie algebra of 
endomorphisms g by TX = TX. From this the characteristic roots of T are 
<f>{T), and V is a weight space of dimension less than the dimension of V. By 
induction we can find a basis X 2 ,..., X m of Fso that 

TX 2 = 4>(T)X 2 

TX 3 = a 23 (7)X 2 + <KT)X 3 

_ m- 1 

TX m = Y a jm (T)X J + 4>(T)X m . 

j-2 

Thus for Xj = X t + W and W = KX x , we can find a basis X 1 . X m of 

V(<j>) so that 

TX , = <t>(T)X 1 

TX 2 = a 12 (T)X 1 + cf>(T)X 2 

m- 1 

TX m = I a Jm (T)Xj + <j>(T)X m . 

j= i 

Thus we have the desired basis for 

For part (c), we show a weight 0 is a linear functional as follows. As in 
(b), let 0 # X e V(<f>) be such that for all Te 9 , TX = 4>(T)X. Then for S, Te g 
we have S H- Teg and 

^(5 + T)X = (S + T)X =SX+TX= + <f>(T)]X 

so that <j)(S 4- T) = <j)(S) + Similarly c/)(aS) = a(j>(S) for as K, and also 
T])X = [S, T\X - (ST )X - ( TS)X = <j>(S)<j>{T)X - <t>{T)<j>{S)X = 0. 
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Thus since the elements of [ g , g] are of the form £ [S, T], this implies 

<Klg, a\) = 0 . 

Exercises (1) State results analogous to those in Theorem 11.14 for 
nilpotent Lie groups. [Recall if/: G -> R* is an analytic character (Proposi¬ 
tion 10.17), then Tf(e) : g -» R is an algebra homomorphism.] 

(2) (i) Let ^ be a nilpotent Lie algebra of derivations of a non- 
associative algebra A over an algebraically closed field. Decompose 
A = A(<j>i) + • • • + A(<f> k ) into weight spaces relative to 3>. Show A(<j>i)A(<l>j) = 
{0} if </> f + <l>j is not a weight, and A(<t>i)A(^>j) c A(<p t + <j)j) if <t>i + <l>j is a 
weight. 

(ii) Let g be a Lie algebra over an algebraically closed field and let h 
be a nilpotent subalgebra of g. Then what can be said about a weight space 
decomposition of g relative to ad A? 

(iii) Let g be a Lie algebra over an algebraically closed field, and let D 
be a nonsingular derivation of g. Show that g is a nilpotent Lie algebra. 



CHAPTER 12 

SEMISIMPLE LIE GROUPS 
AND ALGEBRAS 


We have previously defined a Lie group G to be semisimple in case its Lie 
algebra g is semisimple. Consequently in this chapter we discuss generalities 
on semisimple Lie algebras over a field K of characteristic 0 and in the re¬ 
maining chapters investigate them in more detail. First we consider a non- 
associative algebra A with a certain invariant bilinear form f : A x A K 
and show that if this form is nondegenerate, then A is a direct sum of 
ideals which are simple algebras. We apply this to the case where A is associ¬ 
ative and the form/(X, Y) — trace L(X)L(Y) and discuss the semisimplicity 
of A. 

With the associative algebra as a model, we prove results due to Cartan 
which lead to the fact that a Lie algebra g over K is semisimple if and only if 
the form f(X> 7) = trace ad X ad Y is nondegenerate. As immediate corol¬ 
laries we prove that a nonzero ideal of a semisimple Lie algebra is also semi¬ 
simple and that a derivation of a semisimple Lie algebra is inner. We eventu¬ 
ally use this to show that for a large class of “semisimple” nonassociative 
algebras every derivation is inner. 

Next we come to the very important result that a Lie algebra g is semi¬ 
simple if and only if every ^-module is completely reducible. Using this we 
show that if a Lie algebra g of endomorphisms acts in a completely reducible 
manner on a vector space V, then g = c © g\ where c is the center of g and 
9 ~ 9] is semisimple or {0}. As applications of these results we discuss the 

nilpotent radical of a Lie algebra and the tensor product of completely 
reducible ^-modules. 
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1. Invariant Bilinear Forms 


We shall show in this section how certain bilinear forms can be used to 
decompose nonassociative algebras into a direct sum of ideals which are 
simple algebras. In particular, this will yield for Lie and associative algebras 
that they are semisimple; that is, the algebras contain no proper solvable 
ideals. 

Definitions 12.1 (a) Let g be a Lie algebra over the field K of character¬ 
istic 0 and let p : g -* gl(V) be a finite-dimensional representation of g where 
V is a finite-dimensional vector space over K. Then the map 

r : g x g -*■ K : (X, 7) -*■ trace p(X) ° p(Y) 

is a symmetric bilinear form on g called the trace form for g relative to p. 
In particular, for V = g and p = ad the corresponding trace form is called 
the Killing form and we shall frequently denote this form by Kill(2f, Y). 

(b) Let/ be a bilinear form on the vector space V, let G <= GI,(V) be a 
Lie group, and let g c: gl(V) be a Lie algebra. Then f is called G-invariant 
if for all X , Y e V and A e G we have 

/(AX, AY) = f(X, Y). 

Similarly /is ^-invariant if for all X , Y e V and D eg we have 
f(DX, Y) + f(X, DY) = 0. 

Note Corollary 12.6 for the relationship between the G-invariance and 
the {/-invariance of/ when g is the Lie algebra of G. 

Proposition 12.2 Let g be a Lie algebra and let p be a finite-dimensional 
representation of g in V. Then the trace form r(X, 7) = trace p(X)p(Y) is 
ad(</)-invariant. 

Proof First recall for endomorphisms S and T that trace ST = trace TS. 
So let X, Y, Z e g, then for D = ad(X) we have 

x(DY,Z) = x([XY],Z) 

= trace p([XY))p(Z) 

= trace [p(X), p(Y)]p(Z) 

= trace p(X)p(Y)p(Z) - trace p(Y)p(X)p(Z) 

= trace p(Y)p(Z)p(X) - trace p(Y)p(X)p(Z) 

= trace p(Y)p([ZX]) = -z(Y, DZ). 
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Corollary 12.3 (a) The Killing form of g is ad(/)-invariant; that is, for 
X, Y,Z eg we have 

Kill([XY], Z) = Kill(X, [YZ]). 

(b) The Killing form of g is Der(/)-invariant, where Der(^) is the deri¬ 
vation algebra of g. 

Proof To see (b) note that if D is a derivation of g, then ad (DX) = 
[D, ad X], Thus as in the calculations in Proposition 12.2 we have 

Kill {DX, Y) = trace ad (DX) ad(Y) 

= trace (D ■ ad X - ad X ■ D) ad Y 
= trace ad X(ad YD - D ad Y) 

= -trace ad X ad DY = -Kill(X, DY). 

Definition 12.4 Let A be a nonassociative algebra over a field K. 
A symmetric bilinear form f : A x A -*■ K is called invariant or associative 
if for all X, Y, Z e A we have 

f(XY, Z ) = f(X, YZ). 

Proposition 12.5 Let V be a finite-dimensional vector space over R and 
let A e End(K) and let / be a bilinear form on V. Then the following are 
equivalent. 

(a) For all X, Y e V we have f(AX, Y) + f(X, AY) = 0. 

(b) For all X, Y e V and teR we have /((exp tA)X, (exp tA)Y) = 
f(X, Y). 

Proof If we assume (b), then we obtain (a) by using the product rule for 
differentiation as in Section 1.2. Conversely, assuming (a), we obtain (b) by 
showing <j>(t) = /((exp U)X,(exp tA)Y) and xj/{t) = f(X, Y) are both 
solutions to the differential equation dzjdt = 0 with z(0) = f(X, Y). Clearly 
tj/(t) is a solution. Next using the product rule we obtain 

4>'(t) = f((A exp tA)X,(e\ p tA)Y) + /((exp tA)X,(A exp tA)Y) = 0, 

where we use (a) replacing X [respectively Y] by (exp tA)X [respectively 
(exp tA)Y] to obtain the last equality. Thus by uniqueness of solutions 
4>(t) = iA(0- 

Corollary 12.6 Let G e GL(V) be a connected Lie group of automor¬ 
phisms of V over R and let G have Lie algebra g <= gl(V). Let / be a bilinear 
form on V. Then/is G-invariant if and only if / is ^-invariant. 
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Exercises (1) Let g be a Lie algebra over R and let/be a bilinear form 
defined on g. Show / is invariant under the identity component of Aut(fir) 
if and only if / is invariant under Der(g). 

(2) The Killing form of a Lie algebra g over K is Aut(#)-invariant. 
[Note for/ e Aut(/) that/ »adX» f~ l = ad(/X).] 

Definition 12.7 Let W be a subspace of V and let / be a symmetric or 
skew-symmetric bilinear form on V. Then 

tV 1 = {X e V: for all Y e W, /(X, Y) = 0} 

is the orthogonal complement of W relative to /. Then / is nondegenerate if 
V 1 = {0}. 

From results in algebra [Jacobson, 1953, Vol. II; Lang, 1965] we have the 
following: 

Proposition 12.8 Let/be a nondegenerate symmetric bilinear form on 
V and let IF be a subspace of V such that W n W 1 = {0}; that is, W is 
nonisotropic. Then V = W + W 1 as a subspace direct sum. 


Lemma 12.9 Let J be a finite-dimensional nonassociative algebra 
over K with a symmetric nondegenerate invariant form / : A x A~* K and 
let B be an ideal in A. Then: 

(a) B 1 (relative to/) is an ideal in A. 

(b) B n B l is an ideal with ( B n B 1 ) 2 = {0}. 

Proof For any X e A, Ye B, and Z e B l we use ABcB to obtain 
f(ZX, Y) = f(Z, XY) 6 f(B L , B) = {0} 

so that B l A cz B 1 . Similarly 

f(XZ, Y) = f(Y, XZ) = f(YX, Z) e f(B, B l ) = {0} 

so that AB 1 <= B 1 , which yields B 1 is an ideal of A. 

To show (b) we use B n B 1 is in both B and B 1 to obtain 

f((B n B 1 ) 2 , A) = f{B n B L , {B n B X )A). 

However, B n B 1 <= B 1 and (B n B L )A c BA c B so that using/(B 1 , B) = {0} 
we obtain 

/(B n B\ (B n B L )A) = {0}. 

Thus (B n B 1 ) 2 c A 1 = {0} using/is nondegenerate. 
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Following Jacobson’s proof [1962] we now prove a structure theorem, 
due to Dieudonne, for algebras with a nondegenerate invariant form. Recall 
that a nonassociative algebra is simple if A 2 # {0} and A has no proper ideals. 

Theorem 12.10 Let A be a finite-dimensional nonassociative algebra over 
K with a symmetric nondegenerate invariant form /. Furthermore assume A 
has no ideals B with B 2 = {0}. Then A is a direct sum of ideals which are 
simple subalgebras. 

Proof Since A is finite dimensional we let B be a nonzero minimal ideal 
of A. Then B n B 1 is an ideal of A such that (B n B 1 ) 2 = {0}, using Lemma 
12.9. Thus, using the hypothesis, we have B n B 1 = {0} so that by Proposition 
12.8 we have A = B © B 1 (direct sum of ideals of A) with B 1 an ideal of A and 
furthermore BB L a B n B L = {0}. From this B is a simple algebra for if C is a 
proper ideal of B , then since CB L = B ± C = {0} we see C is an ideal of A. 
This contradicts the minimal choice of B. 

Next we see that the restriction / of / to B 1 x B 1 defines an invariant 
form on B 1 which is also nondegenerate, for if f(B\ X) = 0 with X e jB 1 , 
then since A = B © B 1 we have f(A, X) = {0} so that X = 0. Thus B 1 satisfies 
the same hypothesis as A and by induction on the dimension we can conclude 
B 1 = A 2 © • * * © A m , where the A k are ideals of B L (and therefore of A) 
which are simple subalgebras. Therefore with A x = B we have the desired 
results A = A x © A 2 © * * * © A m . 

We also have the following uniqueness for the above decomposition. 

Proposition 12.11 Let A be a finite-dimensional nonassociative algebra 
such that 

A = A x © • * ■ © A m = B x © • • * © B n , 

where the A t and Bj are ideals of A which are simple subalgebras. Then 
m — n and for each A t there is a B j(i) with A x — B j(i) . 

Proof For each j = 1 consider the ideal A x n Bj . If A t n Bj = { 0 } 
for each j — 1 , ..., n, then we have 

A x B j = A 1 nB J = { 0} 

using both A x and Bj as ideals of A . Thus since A = B x © • • • © B n we obtain 
A X A = {0} so that A 2 = {0} which contradicts the simplicity of A x . Thus for 
some j = y(l) we have the ideal A x n BjW * { 0 } so that 

A x = A x n ^ ( i) = B jiX) 
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using the simplicity of the ideals A , and Z? j(n . Similarly there exists j(2) with 
A 2 = B j(2 ) so that A 1 BA 2 = B m) © B n2) . Thus we can conclude the proof 
using induction by noting every B } is equal to some A i(j y 

Exercise (3) Let g be a finite-dimensional Lie algebra over K with 
g = g t © g 2 © • • • © g n where the g } are ideals of g which are simple sub¬ 
algebras. Show g = [gg] and for every ideal p of g there exists a unique ideal q 
of g so that g = p®q and p nq = {0}. Is a similar result valid for a general 
nonassociative algebra over K’l 

Example (1) Let A be a finite-dimensional associative algebra and 
L{X) ; A -»A : Y X Y be the left multiplication by X. Let 

f(X, Y) = trace L{X)L{Y). 

Then using L{XY)Z = (XY)Z = X(YZ) = L(X)L(Y)Z we see 

f(X, Y ) = trace L(XY). 

Using A is associative, this shows /is an invariant form on A. 

Next if / is nondegenerate, we shall show there are no ideals B of A with 
B 2 = {0}; for suppose B is such an ideal and extend a basis of £ to a basis of A 

as follows: {X . . X r , X r + U ..., XJ is a basis of A where X u ...,X r eB. 

Then for Z 6 B and X e A, L(Z ) and L(X) have the following block matrices, 
respectively, 


[0 z l2 

and 

[/■ h 

L° ° 


l P ^22J 


which uses B as an ideal and ZB = {0}. Thus we obtain 

/(Z, 2Q = trace q 12 ] £*] =°> 

and since we are assuming/is nondegenerate we have Z = 0; that is, B = {0}. 
This proves the following result. 

Proposition 12.12 Let A be a finite-dimensional associative algebra 
over K and let/(X, Y) = tr L{X)L(Y) be nondegenerate. Then A = A t © • • • 
© A n where each A k is an ideal of A which is a simple subalgebra. 

From Chapter 9 recall that a simple associative algebra is isomorphic to a 
complete matrix algebra over some division ring. We now sketch some rel¬ 
ationships between the radical of a finite-dimensional associative algebra A 
and f(X, Y) = tr L(X)L( Y). First, the radical N of A is usually characterized 
as the maximal nilpotent ideal of A. However, using associativity we see N is 
also the maximal solvable ideal of A. 
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Lemma 12.13 Let A e End(F), where V is an /w-dimensional vector 
space over a field K of characteristic 0. Then A is nilpotent if and only if 
trace A k = 0 for k = 1, 2,..., m. 

Proof From results on the characteristic polynomial det(A — xl) 
we have that trace A ~ where k t are the characteristic roots of A. Since 
the characteristic roots of A k are (2,)* we also have trace A k * (2,)*. First, 

if A is nilpotent, then extend A to an endomorphism of V=P® V where P 
i^the algebraic closure of K. Thus since A is nilpotent on V 9 it is nilpotent on 
V so all the characteristic roots are 0 and therefore trace A k = 0 for k = 1 , 2, 
..., w. 

For the converse we use the fact that 

F(x) = det(x/ - A) = x m -p x x m ~ l + • • • + (-1 ) m p m 

where 

Pi = Y, x i 

pi-'LWi 

i>j 


Pn = Vi • • • A m 

are the elementary symmetric polynomials in the characteristic roots A, e P; 
that is, writing F(x) = (x ~ ■ ■ ■ (x — in P[x\. Next use the fact for 

s k = f° r k = m that we have the following relation proved by 

induction [Jacobson, 1953, Vol. I, p. 110]. 

0 = s k -p l 5 k . 1 +P 2 s k -2 + ••• + (-l)*~V*-i^i +(-l)*A- 

Thus using the hypothesis that s k = trace A q = 0 for q = 1 , ..., m we have 
p k = 0 for k = 1, ..., m so that F(x) = x m . Thus A m = 0. 

Proposition 12.14 Let A be a finite-dimensional associative algebra over 
a field K of characteristic 0, let N be the radical of A, and let f(X, Y) = 
trace L(X)L(Y). Then 

N = A 1 = (Z e A : f(Z, X) = 0 for all X e A}. 

Proof Let Z e N. Then for any Ye A we have L(Z)Y e N and 
[Z.(Z)]* +1 T e N k = N • • ■ N, &-times. Thus since N is nilpotent, L(Z) is nil- 
potent. However, for any XeA we have L(Z)L(X) = L{ZX) is nilpotent 
since ZX e N. Therefore 

/(Z, X) = trace L(ZX) = 0 

so that N c: A 1 . 
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Conversely, let Z e A 1 and let T = L(Z) 2 = L{Z 2 ). Then 
trace T = f(Z,Z) = 0. 

Next using associativity we have 

T 2 = L(Z 2 )L(Z 2 ) = L(Z 3 )L(Z) 

and by induction 

T k = L(Z 2k_1 )L(Z). 

Thus since Z e A 1 we obtain trace T k = 0 for k = 1, 2, ... which implies T 
is nilpotent. Thus L(Z) is nilpotent. Therefore if m = dim A, then the ideal 
A 1 is such that each Z e A 1 satisfies Z m+1 = 0 and this implies A 1 is nilpotent 
using the exercise below. Since the radical N is the maximal nilpotent ideal, 
A 1 <=AT. 

Exercise (4) Let C be an ideal of the finite-dimensional associative 
algebra A so that for each ZeC, there exists m with Z m = 0. Then C is 
nilpotent; that is, there exists N so that {0} = C N = C ■ ■ • C, AT-times. 

Corollary 12.15 Let A be a finite-dimensional associative algebra over a 
field K of characteristic 0. If A has no proper nilpotent ideals, then A = A x 
© • • • © A m where each A k is an ideal which is a simple subalgebra. 

Exercise (5) Let A be a finite-dimensional associative algebra over R. 
Show A is semisimple over R if and only if C ® K A is semisimple over C. 
(Possibly compare the degeneracy of the trace form on A with that on 
C® r A.) This result extends to a field K of characteristic 0 and using the 
results in Section 9.5 on complete reducibility show the following: V is a 
completely reducible ^-module over K if and only if P® K V is a completely 
reducible P® K /4-module where P is the algebraic closure of K. 


2. Cartan’s Criteria 


We now prove results for Lie algebras analogous to those on associative 
algebras involving bilinear forms and semisimplicity. 

Theorem 12.16 (Cartan’s criterion for solvability) Let V be a finite¬ 
dimensional vector space over a field K of characteristic 0 and let g be a Lie 
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subalgebra of gl(V). Then g is solvable if and only if trace(-XT) = 0 for all 
X eg and Y e [gg\ 

Recall from Section 10.2 that a Lie algebra is semisimple if it has no 
proper solvable ideals. 

Theorem 12.17 (Cartan’s criterion for semisimplicity) Let g be a 
finite-dimensional Lie algebra over a field of characteristic 0. Then g is semi¬ 
simple if and only if its Killing form Kill(2f, Y ) = trace ad X ad Y is non¬ 
degenerate. 

For the proof of Cartan’s criterion for solvability we use the methods of 
Bourbaki [1960], Serre [1965], and Tits [1965]. These differ from the methods 
of Hausner and Schwartz [1968] and Jacobson [1962] since the latter introduce 
the Cartan subalgebra first to obtain Theorem 12.16. However, in both cases 
a careful examination of certain rational-valued linear functionals is necessary. 
We shall need the following facts on linear algebra [Jacobson, 1953, Vol. II; 
Lang, 1965]. 

Definition 12.18 Let Fbe a finite-dimensional vector space over a field K 
of characteristic 0. Then A e End(F) is called a semisimple endomorphism if 
the associative algebra K[A] generated by A is a semisimple algebra; note 
I=A°eK[A]. 

Remarks (1) We have AeEnd(V) is semisimple if and only if its 
minimum polynomial p(x) e K[x] is a product of distinct prime polynomials. 
For suppose A is semisimple and its minimum polynomial p(x) — Pi(x) k(l) • • • 
p r (x) k(r \ where some k(i) > 1. Then the element Z = p x (A) ■ • p r (A) is nilpotent 
since Z q — p x (A) q ■ * 'p r (A) q = 0 for large enough q. Since K[A] consists of 
polynomials in A and is commutative, we see that K[A ] • Z is a nilpotent ideal 
in K[A]. This contradicts the semisimplicity of K[A]. Conversely, suppose the 
p k (x) are distinct and each k(i) = 1. Assume K[A] is not semisimple so that 
some Z — f(A ) is nilpotent. Then 0 — Z k — f(A) k implies that the polynomial 
f(x) k is divisible by p(x). Thus since all the k{i) = 1 ,/(*) is divisible by p(x); 
that is,/(X) = g(x)p(x) so that Z = f(A) = g(A)p(A) = 0. 

(2) Using the result that an associative algebra B with B 2 ^{0} is 
semisimple if and only if every 5-module is completely reducible (Section 9.5) 
we see that the endomorphism A is semisimple if and only if for any K[A ]- 
invariant subspace W of F, there exists a K[T]-invariant subspace W' so that 
v = w + W' with WnW' = {0}. 

(3) If K is algebraically closed, we can compute characteristic roots and 
vectors to obtain A as a semisimple endomorphism if there exists a basis 
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X u ..., X m of V so that AXi = k t Xi for some e K ; that is, A is diagonal- 
izable. [This uses remark (1).] 

Using Jordan canonical forms we have the following result: 

Proposition 12.19 Let V be a finite-dimensional vector space over the 
algebraically closed field K of characteristic 0 and let A e End(L). Then 
there exists a semisimple S e End(F) and a nilpotent N e End(L) such that 

A = S + N and SN = NS, 

Furthermore, S and N are uniquely determined by these conditions. There 
exist polynomials ^(x), n(x) e K[x] without constant terms such that S = 5(^4) 
and N = n(A ). Then S (respectively N) is called the semisimple (respectively 

nilpotent) component of A. 

Proof Let F(x) = det(x/ - A) = (x - k x ) kx • * *(x - A r )* r be the factor¬ 
ization of the characteristic polynomial where the A 1} ..., k r are distinct. Then 
following the proof of Jacobson [1953, Vol. II, p. 130] the 

Hi(x) = F(x)j(x - Xi) ki 

for i = 1, ..., r are relative prime. Thus there exist 4> k (x) e K[x] with 

r 

Z = 1- 

i — 1 

Substituting A into this expression we obtain 

I = / 

which gives the direct sum decomposition 

V=V 1 + --- + V r , 

where the Vj are the ^-invariant subspaces 

Vj = 4>j(A)hj(a)v 

= {X e V: (A — AjI f’X = 0}. 

Thus define S by SXj = kjXj for Xj e V } and set N= A-S. Then S is 
semisimple and N is nilpotent and furthermore, by construction, [A, S] = 0 
so that NS = SN. 

A straightforward computation shows that for the polynomial s(x) = 
Z Ai <t>i(x)n j(x> we have S = s(A) and for n(x) = x - s{x) we have N = «(/)). 
We can furthermore assume .v(0) = 0 as follows. If A is invertible, then the 
characteristic polynomial F(x) has a constant term so that I is a polynomial in 
A without constant term. Thus if I appears in the expression S = .?(/!). make 
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the above substitution to obtain another polynomial expression for S in 
terms of A without constant terms. In this case we also see «(0) = 0. If A is not 
invertible, then ker A ^ {0} is invariant under N and 7V|ker(/l) is also nil- 
potent. Thus there is a nonzero X e ker(^) such that NX = 0. Thus since 
N = n{A) and AX = 0, we see that the constant term of «(jc) is 0. Thus 
«(0) = 0 so that 5(0) = 0 from the construction «(jc) — x — 5 (x). 

Exercise (1) The representation A = S + N with SN = NS is unique; 

that is, if A = S + N = S' + N\ where S, S' are semisimple and N, N' 

nilpotent and [S, N] = [S', N'] = 0, then S = S' and N = N'. 

Example (1) Let Aegl(V), where V is finite-dimensional over an 

algebraically closed field K , and let A = S 4- TV be the decomposition into 
semisimple and nilpotent components. Then ad S and ad N are the semi¬ 
simple and nilpotent components of ad A. Since [S, N] = 0 we see 0 = 
ad[S, TV] = [ad S, ad N] so that we must show ad N is nilpotent and ad S 
is semisimple. 

First for any X e gl(V ) we see 

(ad N)X — NX - XN — (L(N) - R(N))X , 

where L(N) and R(N) are left and right multiplications in End(F) and 
[L(N ), ^(A^)] = 0 using the associative law. Thus using the binomial expansion 
we obtain as before 

(ad N) k X = (L(N) - R(N)) k X 

= i(-iyC k , i N k -‘XN i . 

i = 0 

Thus if N q = 0, ad iV is nilpotent. Next since S is semisimple, there exists a 
basis X u ...,X m of V so that SX i = A i X i with X^K. For the basis 
{Eij : i 9 j = 1, ..., m} of gl(V ) defined by E u X k = S Jk X { we obtain the usual 
matrices and consequently the matrix computations give 

(ad S)E ij = (X i ~X j )E ij . 

Thus ad S is diagonalizable on gl(V); that is, ad S is semisimple. 

Lemma 12.20 Let V be a finite-dimensional vector space over an alge¬ 
braically closed field K of characteristic 0. Let T and W be subspaces of gl(V) 
with W^T and let P = {A egl(V ) : [A, W] a T}, Let A eP be such that 

trace AZ = 0 


for all Z eP. Then A is nilpotent. 
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Proof Let A = S + N be the decomposition of A into its semisimple 
and nilpotent components and let X u ..., X m be a basis of V so that SX t = 
X t X t with A f e K and N is given by a nilpotent matrix. Now let L be the vector 
space spanned by the A/s over the rational field Q. Thus L cz K and to show A 
is nilpotent we shall show L = {0} by proving/(L) = {0} for every Q -linear 
functional/ eL*. 

Thus let X u ..., X m be the above basis of V and let U e End(F) be the 
semisimple endomorphism defined by 

UX k = f(X k )X k9 

where / is any given linear functional in L*. Let {E tj : /, / = 1, ..., m) be the 
basis of gl(V) defined by the basis X u ..., X m as in the preceding example. 
Then from this example we can deduce 

(ad S)%j = (A f - Xj)%j for A: =0,1,2,... 

and 

(ad u)E tJ = (/ad - /a J ))E ij = /a, - Xj)e,j, 

using U as semisimple and / as linear. 

Next by the interpolation formula we can find a polynomial p(x ) e 
which goes through the finitely many points 0, /(A f — Ay). Thus p(x) has 0 
constant term and p(X t - Ay) = f(X { - Ay). Using this and the above formula 
for (ad S) k E tJ = (A f - Xj'fEy we see 

(ad U)E ij = f(Xi - Ay)£ u 

= p(Xi — Ay)£/y = p(p. d S)E t j 

so that ad U = p (ad S). From the example, ad S = s(ad A ), where s(x) is a 
polynomial without constant term, and since (ad A)IV cz T we see (ad U)W = 
[{p o s )(ad ^)]1F c J. Thus by the definition of P, we have U € P. 

Since U e P we have by hypothesis and the construction of U , *S, and iV 
relative to the same basis X j,..., X m of F, that 

0 - trace AU = trace (S + N)U — trace SU = £ A,/^). 

Therefore since /(A f ) e Q and / is a g-linear functional, we apply / to the 
above equation to obtain 

0 = I /( A ,.) 2 

which implies /(A f ) = 0. Thus /(L) = {0} and since / is arbitrary in L* we 
have L — {0}, so that is nilpotent. 

We now use this to prove Cartan’s criterion for solvability: If g is a sub¬ 
algebra of gl{V) where V is finite-dimensional over a field K of characteristic 0, 
then g is solvable if and only if trace XY = 0 for all X eg and Y e [g, g]. 
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First assume the field K is algebraically closed and suppose trace X Y = 0 
for all X eg and Y e [g, g]. Then we shall apply Lemma 12.20 with W = g 
and T = [g, g] to obtain 

Iff, g]<=-P = {Z e gl(V ): [Z, g] c [g, g]}. 

Thus for any X, Y eg and Z eP we have 


trace [X, Y]Z = trace X[Y, Z] = 0, 

where we use trace XA = 0 for all X eg and A e [g,g] for the second equality. 
Thus by the linearity of “trace” on gl(V) we have trace AZ = 0 for all 
A e [g,g] and Z eP. Therefore by Lemma 12.20, every A e [g,g] is a nilpotent 
endomorphism. Thus by Corollary 11.12 to Engel’s theorem [g, g] is a 
nilpotent Lie algebra. However since g/[g, g] is Abelian and [g. g] solvable, 
we have by Corollary 10.8 that g is solvable. 

Conversely, if g is solvable, then by Lie’s theorem (Proposition 10.23) 
we can find a basis which simultaneously puts the matrices for [g, g] in 
triangular form with 0’s on the diagonal. Thus for X e g and Y e | g, g]. 


«n 


* 


ro 


trace XY = trace 


= 0 


0 


0 


which proves the result if K is algebraically closed. 

Next if the field K is not algebraically closed, let P be its algebraic closure 
and let g = P ® K g . If g is solvable, then from Section 10.2, g is solvable. 
Thus trace X Y = 0 for all X e g and Y e [g , g] implies trace X Y = 0 for all 
X eg and Y e [g, g\ 

Conversely, if trace 0 for all X eg and Y e [g , g\ then for any 
X = Y J w i X i egand Y=J^u pq [X p , X q ] e [g,g], where w t ,u pq eP and X u .... 
X m is a basis of g , we obtain 


trace XY~J] w i u pq trace Xil x P > X q ] = 0. 
Thus g is solvable and since g a g we have that g is solvable. 


Exercise (2) Using the fact that for A, B e End(F), 

2 trace AB = trac e(AB + BA) 

— traced + B) 2 — trace A 2 — trace B 2 
show the following result: 

Corollary 12.21 If g is a Lie algebra over a field of characteristic 0, then 
g is solvable if and only if trace(ad X) 2 = 0 for all X e \gg\ 
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We can now prove Cartan’s criterion for semisimplicity: If g is a Lie 
algebra over a field of characteristic 0 , then g is semisimple if and only if the 
Killing form of g is nondegenerate. 

First assume g is semisimple and let h= g L = {Z eg : Kill(X, Z) = 0 for 
all Xeg}> Then h is an ideal of g. Furthermore we have for each Z eh 
and Y e [gg\ that 

trace ad Z ad Y = Kill(Z, Y) = 0. 

Therefore by Cartan’s criterion for solvability we see ad h is a solvable sub¬ 
algebra of gl(g). However, ad h is isomorphic to hjZ where Z = ker ad is 
commutative. Thus since h/Z is solvable (since ad h is solvable) and Z is 
solvable, h is a solvable ideal. Therefore h c= rad g = {0} which shows the 
Killing form is nondegenerate. 

Conversely, assume that the Killing form is nondegenerate. Then g has no 
proper ideals h with [hh\ — { 0 }; for assume h is such an ideal and extend a basis 
of h to a basis of g. Thus as in example (1) in Section 12,1 for an associative 
algebra, we have for Z eh and X eg that 


adZ = 

0 z 12 

and 

adX = 

*n * 12 ' 

V 

0 0 



0 X 22 


so that Kill(Z, X) = 0; that is, h = {0}. However, if r = rad g #= 0, then in the 

series r >r (2) -->r (n_n >r (n) = {0} we see r (n_1) is a nonzero ideal of g 

which satisfies [r ( "“ 1 V ( "" 1) ] = 0 (This uses that if h and k are ideals of a Lie 
algebra g, then the product [Me] is an ideal of g). Thus this contradiction 
shows r = { 0 }. 

Corollary 12.22 Let g be a semisimple Lie algebra over a field K of 
characteristic 0 and let p be a faithful (injective) representation of g in a 
finite-dimensional vector space V over K. Then the bilinear form t(X, Y) = 
trace p(X)p(Y) is nondegenerate. 

Proof Analogous to the preceding proof one sees that h = {Z eg : 
t(Z, Y) = 0 for all Y eg} is an ideal of g such that p(h) is solvable in gl(V). 
However, since ker p = {0} we have h is solvable. Thus h = {0} so that t is 
nondegenerate. 

Corollary 12.23 Let g be a finite-dimensional Lie algebra over a field K 
of characteristic 0. Then: 

(a) g is semisimple if and only if g = g x © • • • © g n where each g k is an 
ideal of g which is a simple subalgebra; 

(b) g is semisimple if and only if g has no ideals h with [hh] = { 0 }. 
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Proof This follows from the fact that g is semisimple if and only if 
Kill(2f, Y ) is nondegenerate, and Theorem 12.10 which then expresses g as a 
direct sum of simple subalgebras which are ideals. 

Remarks (4) We shall now outline the proof that if g is a simple Lie 
algebra of endomorphisms over an algebraically closed field K of characteristic 
0, then for all X, Y eg 

KillCY, Y) - X trace XY 

for some X e K. 

Exercises (3) Let /be any nondegenerate invariant form on the simple 
Lie algebra g. Show that there exists S e GL(g) so that for all X, Y eg 

f{X , Y) = Kill (SX, Y) and Kill (SX 9 Y) - Kill(X, SY). 

(4) Using the invariance and nondegeneracy of f and Kill, show [ad X, S] 
— 0 for all X eg. Since g is simple, conclude by Schur’s lemma [Proposition 
9.14(a)] that S = XL 

(5) (i) Let f(X, Y) — trace XY and show/is an invariant form. 

(ii) Use the fact that g is simple to show that g x (relative to /) is g or 

{ 0 }. 

(iii) Use Lemma 12.20 and (ii) to show that / is nondegenerate and 
conclude the desired formula. 

(iv) What can be said about an invariant form of a semisimple Lie 
algebra? 

(6) Let g be the Lie algebra of n x n matrices of trace 0 over the field K 
as above. 

(i) Show g is simple as follows. With the usual matrix basis E tj note that 
the E XJ for i *£j and H k = E kk - E nn for k < n — 1 form a basis of g. Relative 
to this basis of g find the multiplication table. Assume h is a nonzero proper 
ideal and let Z = Y a i H x + Y/u Eij e h- Carefully grind out products 
[ H { , [Hj , Z]] and compare with Z to eventually show h = g. A better method 
will be given later. 

(ii) Show Kill(AT, Y) * 2 n trace XY for g as in (i). 

(7) Let g be a semisimple Lie algebra and let h be a semisimple sub¬ 
algebra. 

(i) Show that p : h-> ad(#): X -»■ ad X is a faithful representation of h 
in g where ad is the adjoint in g . Thus g is an ^-module. 

(ii) Show that g has the direct sum decomposition g = m + h where 
m ~ h 1 , the orthogonal complement of h relative to the Killing form; note 
Corollary 12.22. 
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(iii) For the decomposition in (ii) define an anticommutative multi¬ 
plication XY on m by XY = [XT] m which is the projection of [XY] in g 
into m . Denote this algebra by (m, XY). For X, Y e m let/(X, Y) = Kill(X, Y) 
and show /is a nondegenerate invariant form on (m, XY). Sagle and Winter 
[1967] show that if g is simple, h is semisimple, and the multiplication 
mm # {0}, then (ra, X Y) is a simple algebra. 


3. Ideals and Derivations of Semisimple 
Lie Algebras 

If h is an ideal of semisimple Lie algebra g over a field of characteristic 0, 
then we shall show that h itself is a semisimple Lie algebra. In particular, 
this will yield the proposition that all the derivations of g map h into h and 
also that Dcr(g) = ad(/); that is, the derivations of g are inner. 

Theorem 12.24 Let h be an ideal of a Lie algebra g over a field of char¬ 
acteristic 0 and let Kill (respectively C) denote the Killing form of g (respect¬ 
ively h). 

(a) If X, 7 e h, then C(X, Y) = Kill(X, Y). 

(b) If g is semisimple and A e h is such that C(A, Y) = 0 for all Y eh, 
then A = 0. 

(c) If g is semisimple, then h is a semisimple Lie algebra. 

Proof (a) Since h is an ideal of g we can decompose the vector space 
g — h + b (subspace direct sum) where [hb] c= h and [hh] c: h. Thus choosing 
a basis from this decomposition we obtain for any X eh, 

o' 2 ] 

where X n is a matrix for ad h X. Thus for X, Y eh we have 

KillHr, y) - trace [*“ £”][ o y '‘ o y “] 

= trace X u Y n 

= trace ad* X ad* Y = C(X, 7). 

(b) Let A eh be such that C(A, Y) = 0 for all Yeh and let Weg. 
Then we use [XW] eh for any X eh to obtain 

Kill([,4X]j W) = Kill(/t, [XJ7]) = C(A, [XW]) = 0 
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using part (a) for the second equality. However since we are assuming g to 
be semisimple, we have by Cartan’s criterion for semisimplicity (Theorem 
12.17)that 1 AX] = 0 for all X eh. Thus using g = h + h we see 


and since 


we obtain 



Thus A ~ 0 using the nondegeneracy of the Killing form of g. 
(c) Use part (b) and Cartan’s criterion for semisimplicity. 


Exercise (1) Let g be a semisimple Lie algebra over a field of character¬ 
istic 0 and let g — g{ 1) © * ■ * © g(ri) be its decomposition into simple ideals. 
If h is an ideal of g , show h = g(i x ) © * * * © g(i k ) for suitable i u ..., i k . In 
particular, this shows [hh] = h. 


Corollary 12.25 Let h be an ideal of a semisimple Lie algebra g over a 
field of characteristic 0. 

(a) If gjh ^ {0}, then gjh is a semisimple Lie algebra. 

(b) If D is a derivation of g , then Dh cz h. 

Proof Since gjh is not {0} we use g = g{\) © • • • © g(n) and h = g(i x ) 
© * * * © g{i k ) to see that gjh is isomorphic to the semisimple Lie algebra g(j\) 
© • • * © g(j s ) where j t ${i i9 /*}. To prove (b) we use h = [hh] and 
D[XY] = [DXY] + [XDY] to obtain 

Dh = D[hh\ cz [Dhh] + [hDh] cz h 
since h is an ideal of g. 

Theorem 12.26 If g is a semisimple Lie algebra over a field K of char¬ 
acteristic 0, then Der(#) — ad(#). 

Proof We follow the proof given by Jacobson [1962] where just the 
nondegeneracy of the Killing form is used. Thus let D e Der(^) and let 

(p : g -> K : X trace(ad X ° D). 
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Then 0 is a linear functional on g and since the Killing form of g is non¬ 
degenerate, there exists A e g such that for all X e g 

<f>(X) = Kill(X, A\ 

(See the works of Lang [1965] and Jacobson [1953, Vol. II] for the rep¬ 
resentation of linear functionals in terms of a nondegenerate bilinear form.) 

Let £ — D — ad A. Then £ is a derivation of g and for any X eg we have 

trace(ad X ° E) = trace(ad X ° D) - trace ad X o ad A 
= 0(20 - Kill(X, A) = 0. 

Using this we have for any X, Y eg that 

Kill(£X\ Y) = trace ad (EX) ad Y 
= trace[£, ad X] ad Y 
= trace(£ ad X ad Y - ad X E ad Y) 

= trace(£ ad X ad Y - E ad Y ad X) 

= trace £[ad 2f, ad 7] 

- trace E ad [XY] = 0 

where the second equality uses ad (EX) = [£, ad X] for any derivation £ 
and the fourth equality uses trace PQ = trace QP for endomorphisms P 
and Q. Thus by the nondegeneracy of the Killing form we have EX = 0 so 
that D = ad A is inner. 

Exercise (2) According to Definition 7.7 what can be said about the 
automorphisms of a semisimple Lie algebra over R ? 


4. Complete Reductibility and Semisimplicity 

In this section we shall discuss results for Lie algebras analogous to those 
for associative algebras concerning semisimplicity and complete reducibility. 
In particular, we shall prove WeyTs theorem which states that a Lie algebra 
g is semisimple if and only if every ^-module is completely reducible. Then 
we apply this to find the structure of a Lie algebra of endomorphisms which 
acts in a completely reducible manner on a vector space. As before, let K 
denote a field of characteristic 0. 

Recall the following from Section 9.5. Let V be a finite-dimensional g- 
module and let p be the corresponding representation. Then V or p is called 
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irreducible or simple if the ^-module V has no proper ^-submodules. Also 
V or p is called completely reducible or semisimple if the ^-module V is a 
direct sum of irreducible ^-submodules. 

As for associative algebras, V is a completely reducible ^-module if and 
only if for every ^-submodule W of V, there exists a ^-submodule W so 
that V = W + W' (subspace direct sum). For example, if g is a semisimple 
Lie algebra, then g is a completely reducible ^-module. We now follow 
Bourbaki’s proof [1960] of Weyl’s theorem. 

Lemma 12.27 Let g be a Lie algebra over the field K, let h be an ideal of g, 
and let p be a representation of g in the finite-dimensional vector space V. 
Assume that the bilinear form 

t(X, Y ) = trace p(X)p(Y ) for X,Y eg 

is nondegenerate when restricted to h x h. Let {X t , X„} be a basis of h 
and let {X t ', ..., X n '} ch be a dual basis; that is, x(X t , X/) = d u . 

(a) The element 

C = t P(Xt) ° piX-) 

i= 1 

is an endomorphism of V which commutes with every endomorphism p(A) 
for A eg. We call C the Casimir operator (of h ) corresponding to p. 

(b) If V is an irreducible ^-module, then the Casimir operator of g is an 
automorphism of V. 

(c) trace (C) ~ n(= dim h) 9 where C is the Casimir operator of h. 

Proof A straightforward calculation shows that the bilinear form r is 
invariant. Thus for A, X, Y eg we have t([AX], Y) + r(X, [AY]) = 0. 
Now for A eg and X t , X k ' eh as above set 

[AX,] = X a Jt Xj and [AX k '\ = a' pk X p ’ 

J P 

using [ gh ] c h. Then we obtain 

a jt = <[AX t l X/) = t(X i , [AX/]) = -a' u . 

Now for any A eg we have 

lp(A), C] = [p(A),l t p(X i )p(Xi)] 

= I p(X t )]p(XD + p{X t )[p{A), p(X ; ')]) 

i 

= Z (pdAX^piX.) + p(XM[AX /])) 

i 

= E a jip(X J )p(X-) + a’j.piXMX/) = 0 
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using aji— —a\j, Thus the endomorphism C commutes with every endo¬ 
morphism p(A) for all A eg. Next we have 

trace(C) = £ trace p{X^p(X{) 

i 

= 'Er(X i ,X i ') = n 

i 

using x(X t , X/) = 6^ and n = dimh. Thus C ^ 0. In particular, for the 
Casimir operator C of g we have from [C, p(g)\ == {0} that ker C is p(p)- 
invariant. Thus by Schur’s lemma [Proposition 9.14(a)], C is an automorphism 
if V is p-irreducible. 

Exercises (1) Show the Casimir operator is independent of the choice of 
basis. 

(2) The following result on linear algebra is used in the next proof. 
Let IK be a subspace of the vector space V and let T e End(K) be such that 
T\W-+W and T\W = XI, where I is the identity endomorphism on W. 
Then T \ W is called a homothetic endomorphism (relative to W). 

(i) Show that M = {T e End(F): T(V) c W and T \ W is homothetic} 
is a subspace which contains the subspace AT = {T e M : T( W) = {0}}. 

(ii) Decompose V = W + P into a subspace direct sum and let T e M 
be as in (i). Relative to a basis chosen from this decomposition show T has a 
matrix of the form 

[f o 2 ] 

for X e K. Thus show that T = XE + T\ where E 2 = E and T' e N is as in (i). 

(iii) Use(ii) to show N is of codimension one in M; that is, dim M/N = 1. 
Also show we can choose EV = W and P — ker E in (ii). Thus V = W + ker E 
is a direct sum. 

Lemma 12.28 Let g be a Lie algebra over the field K. Then the following 
are equivalent. 

(a) Every representation of g in a finite-dimensional vector space is 
completely reducible. 

(b) If p is a representation of g in a finite-dimensional vector space M 
and if N is a subspace of M of codimension 1 so that for every X eg we have 
p(X)M c: N, then there exists a direct sum complement of iV in M which is 
annihilated by p(g ). 

Proof First assume (a). Then for the subspace N of (b) we have 
p(g)N c N so that N is actually a p-submodule. By the complete reducibility 
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there exists a submodule N' with M = N + N' (direct sum). Thus p(g)N' c N' 
but also by hypothesis p(g)N' a N. Consequently p(g)N' c N n N' = {0} 
which proves (b). 

Next assume (b) and let a be a representation of g in the finite-dimensional 
vector space V; that is, a{g) is a Lie subalgebra of gl(V). Let W be a(g)- 
submodule of V. Then we shall show IT has a variant direct complement 

in V. Let v denote the adjoint representation of gl(V) and define p by 

p = v ° a : g -* ad (gl{V))\ 

that is, for X e g we have p(X) = ad(er(20) so that for T e gl(V) we have 
p(X)T = [a(X), T], Thus p is a representation of g in the space gl(V). Let 

M = {T e gl(V) : T(V) a W and T\Wh homothetic}, 

N = {T e M : T(W) = 0}. 

Then from the preceding exercise we have iV is a subspace of M such that 
dim M/N = 1 and also p(g)M a N as follows. Let X eg, T e M, and AeW. 
Then using W is (j(^)-invariant 

(p(X)T)A = KX), T]A 

= o(X)(TA) - Ta(X)A 
= a(X)(U) - a(<7(X)A) = 0 

so that p(X)T e N. 

Now from our asumption (b) we can find a direct sum complement N' 
of N in M which is annihilated by p(g). Since M = N + N' and dim N’ = 
dim M/N = 1 we can assume N' = KE where E is the endomorphism of the 
preceding exercise. Thus since p(g)N’ = {0} we have for all X eg that 

0 = p(X)E= KI), E] 

and therefore ker E is <r(#)-invariant. Thus from exercise (2iii) V = W + ker E 
is a ff(#)-mvariant direct sum decomposition so that a is completely reducible. 

Lemma 12.29 Let g be a semisimple Lie algebra, let p be a representation 
of g in a finite-dimensional vector space M, and let ;V be a subspace of co¬ 
dimension 1 so that for every X eg we have p(X)M c N. Then there exists a 
direct sum complement of N which is annihilated by p(g). 

Proof For X eg let a(X) = p(X) | N. Then we have the following two 
cases. 

Irreducible case Assume a is simple; that is, N is irreducible. If a = 0, 
then for X, Y eg we have using the hypothesis 

p{X)p(Y)M e p(X)N = <t(X)N = {0} 
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so that p(X)p(Y) — 0. Thus since g is semisimple, g = [gg] and therefore 
p(g) = p([gg]) = [p{g), p(g)] = {0}. Consequently a = 0 implies p = 0. 

If 0, let k = ker <r. Then k is an ideal of g and since g is semisimple, 
there exists an ideal h of g so that we have the direct sum of ideals 

g - h © k. 

Now h / {0} otherwise g = k so that <7 = 0 and also p \ h is injective. To see the 
latter let Xeh and suppose p(X) = 0. Then a(X) = p(X)\N = 0. Thus 
X e ker a — k and therefore X e h n k = {0}. Using this we have from 
Corollary 12.22 that the bilinear form 

x(X 9 Y ) = trace p(X)p(Y) 

for X, Y e h is nondegenerate and from Lemma 12.27 we obtain the Casimir 
operator C e End(M). From the formula 

c-Zp&MXt) 

i 

for X i9 X{ 6 h we have, using the Hypothesis p{g)M c N, that 

C(M) c= N. 

Therefore by extending a basis of N to a basis of M we obtain 
trace(C | N) = trace C = dim h ^ 0 

so that C | iV 7 ^ 0. From this and our assumption that N is irreducible we 
obtain C | N as an automorphism. Thus if we let P = ker C, we obtain 

PnN = { 0 }, 

but from C(M) c: JV we obtain N = Image(C) and the direct sum 

M = TV + P. 

However, from Lemma 12.27 we have C] = 0 for all A e g and therefore 
P is p(^)-invariant. This with the hypothesis gives p{g)P a P n N = {0}; that 
is, P is a direct complement of TV annihilated by p(g). 

General case We do this case by induction on the dimension of M. 
In the <r(#)-module TV let S be a nonzero minimal cr(#)-submodule; that is, 
S cz N and S is an irreducible cr(#)-submodule of M. If S = N, then we are in 
the preceding case; otherwise let 

M' = M/S and N' = JV/S. 

Now for every e g the mapping p(X) induces an endomorphism p'(X) : 
M' -* M f and the mapping 

p':g^gl(M'):X^p'(X) 
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is a representation of g in the space M'. By the hypothesis we have 

and by the isomorphism (M/S)/(N/S) £ M/N we have N' is of codimension 
1 in AT. Thus by induction we can assume there exists a subspace P* c= M' of 
dimension 1 such that 

M' = N' + P (direct sum) and p'(g)P' = {0}. 

Now let P c M be the inverse image of P relative to the homomorphism 
M -> M'. Then since P => S and 

1 = dim P = dim P/S and p(g)P <= S 

we can now apply the first part of this proof to the irreducible module S and 
its ambient module P . Thus there exists a subspace gcP so that we have the 
direct sum 

P = Q + S and p(p)Q = {0}. 

From the direct sum M' = N' + P we have (O') = N' n P' so that 

S = JVnPc: jV. 

Also from M' = N' + P' we have for any A e M that there exist B e N, 
C eP, and DeS so that 

^=P + C + i) = C*fP 

for some B f e N (using S <= A). However, from the decomposition P = Q + S 
we can write C = F + H for F e Q and H e S N. Thus we obtain ^4 = F + 
iT where Pe0 and B" e N; that is 

M = Q + N* 

This sum is direct: if ZeQnNczPnN=S, then Z e Q and Z e S; that 
is, Z g Q n 5 = {0} using the direct decomposition P = Q -f S. Thus we have 
the desired direct sum 

M = 0 + N with p(g)Q = {0}. 

Theorem 12.30 (Weyl’s theorem) Let g be a finite-dimensional Lie 
algebra over the field K of characteristic 0. Then g is semisimple if and only if 
every ^-module is completely reducible. 

Proof Combining Lemmas 12.28 and 12.29 we find that semisimplicity 
implies complete reducibility of modules. For the converse we note the hypo¬ 
thesis shows that the adjoint representation is completely reducible. Thus every 
ideal h of g has a complementary ideal k such that g = h © k and the map 

g/k ~ h 
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is a representation of g. Now if g is not semisimple, then from Section 10.2, g 
has an Abelian ideal h. Writing the direct sum h — KX t + * • • + KX n for a 
basis X u ..., X n of h we have that the map \j/ 

ij/ : h-> KX t 

is a representation of h (using [hh\ = {0}). Thus 

\j/ o (j): g -+K 

is a representation of g. However, the one-dimensional algebra K has a 
representation 

p:K^gl(2,K):x^^ x J] 

which is not completely reducible. Thus p ° ij/ © ^ is not a completely reducible 
representation of g. This contradiction shows that no Abelian ideals exist in 
g; that is, g is semisimple. 

Exercises (3) Show the above representation p is not completely 
reducible. 

(4) Let h be a semisimple Lie subalgebra of the Lie algebra g. Show there 
exists a subspace m of g such that g = m + h (direct sum) and [hm\ c= m; 
that is, ( g , h) is a reductive pair. Note Section 12.2, exercise (7). 

Remarks (1) If g is a semisimple Lie algebra of endomorphisms acting 
on the finite-dimensional vector space V over a field K , then V is completely 
reducible relative to g. Consequently the associative algebra g* generated by 
the endomorphism in g is a semisimple associative algebra (Proposition 9.15). 
Thus results on representations of semisimple associative algebras can be 
applied to representations of semisimple Lie algebras. For example, let g be a 
Lie subalgebra of gl{V). Then from complete reducibility we see g is com¬ 
pletely reducible in V if and only if g* is completely reducible in V. However, 
using exercise (5) of Section 12.1, g* is completely reducible in V if and only if 
P ®k9* is completely reducible in P ® K V. Combining these we obtain the 
following which allows one to assume algebraic closure of the field when 
discussing complete reducibility. 

Proposition 12.31 Let g be a Lie subalgebra of gl(V) where Fis a vector 
space over K and let P be the algebraic closure of K. Then g is completely 
reducible in V if and only if P ® K g is completely reducible in P ® K V. 

We now consider the explicit form of a completely reducible Lie algebra of 
endomorphisms. 
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Theorem 12.32 Let V be a nonzero finite-dimensional vector space over 
a field K of characteristic 0 and let g be a Lie subalgebra of gt(V). Then g is 
completely reducible in V if and only if 

(a) g — c ® g' (direct sum), where c is the center of g and g' — [ g , g] is 
semisimple or {0}; 

(b) all the endomorphisms of c are semisimple. 

We divide the proof into several lemmas. 

Lemma 12.33 Let V be a nonzero finite-dimensional vector space over 
the field K and let g c gl(V ) act irreducibly in V. Let h be an ideal of g so that 
every endomorphism of h is nilpotent. Then h = 0. 

Proof Using AX - XA e h for al \Aeg and X eh we see that the sub¬ 
space hV is a ^-submodule of V. By the irreducibility of V we have hV = {0} 
or V. If hV = V , then by induction 

V= hV=h 2 V="' = h k V="'. 

However by Corollary 11.11 to Engel’s theorem we have h n = {0} for suitable 
n. Thus we obtain the contradiction V = {0}. Therefore {0}. 

Lemma 12.34 Let V be a finite-dimensional vector space over the field 
K and let T e gl(V ) be of the form 

T = £[A k ,B k ] 

k= 1 

where A k , B k e gl(V) and [T, A k ] = 0 for k = 1, ..., r. Then T is nilpotent. 

Proof From |T, A k ] = 0 we have [T p 9 A k ] = 0 for p = 0, 1, 2, ... and 
therefore for p = 0, 1,2... 

T" +i = YT p (A k B k -B k A k ) 

k 

= = I , r'jy. 

ft k 

Thus since the trace of any commutator is 0 we have trace T p+i =0 for 
p — 0, 1,_By Lemma 12.13 we obtain T as nilpotent. 

Lemma 12.35 Let g be a Lie algebra over the field K , let g f = [gg], and 
let c be the center of g. Assume 

(a) c n g' = {0} and 

(b) all Abelian ideals of g are contained in c. 

Then g = c © g’ (direct sum) where g ' is semisimple or {0}. 
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Proof Assume g' is not {0} and using (a) let p = c © g' and let q be a 
complementary subspace of p in g. Thus we have the direct sums of subspaces 

g=p+q = c+g'+q 

and we let m = g' + q. Therefore m n c = { 0} and since m => g' we have 
[gm\ c [gg] c/nso that m is an ideal of g. Therefore g = c ® m as a direct 
sum of ideals. To show that m is semisimple let h be an Abelian ideal of m . 
Then using g = c ® m we have 

[gh\ c [ ch ] + [mh] c h 

so that h is an Abelian ideal of g. However, from assumption (b) we have 
kc and therefore h a c n m = {0}. Consequently by Corollary 12.23(b), 
m is semisimple. Thus since m = [mm] we have g' = [gg] = [c + m c 4- m] — 
[mm] = m. 

Proof of Theorem 12.32. We first show conditions (a) and (b) above 
hold for the irreducible case, for let g act irreducibly on V and let 

Tseng'. 

Then T = £ U* > B kl and [T, A k ] = 0 (since T e c ). By Lemma 12.34 we 
have T as nilpotent. Thus since every element of the ideal eng' is nilpotent 
we have by Lemma 12.33 that c n g' = {0}. Thus condition (a) of Lemma 
12.35 is satisfied. For condition (b) let h be an Abelian ideal of g and let 
T e [h, g] which is an ideal of g. Then 

T = Z[A k ,B k ] 

for A k e h and B k e g. Since h is an ideal we have Teh and since h is Abelian, 
IT, A k ] = 0. Thus, by Lemma 12.34, 7is nilpotent and as before, from Lemma 
12.33, [ h , g] = {0}. Therefore h<= c. 

Next assume V is ^-completely reducible with direct sum decomposition 

K= Vi + ••• + V r 

where each V t is ^-irreducible. The set 

k i = {Aeg:AV i = {Q>}} 

is an ideal in g and note that Hi=i = W- Let g t = gik ; and let c i be the 
center of g t . Then V t is an irreducible ^.-module (where the action is defined 
by TX = TX for T = T + k t eg t and X e F,). Now conditions (a) and (b) hold 
for g t on V t \ that is, for the irreducible case, and they hold for g on V as 
follows. 

(a) Let A e g’ n c. Then for each i = 1,..., r 
A + kf eg n c f = {0J. 
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Thus A ek i9 that is, A e f] k { = {0} so that g' n c = {0}. 

(b) If h is an Abelian ideal of g, then R = h + k t is an Abelian ideal of g ( . 
Therefore h c: c f so that [H, g t ] — {0;}. Thus [h, g] a k t for f = 1,..., r which 
yields [A, #] <= pj k t = {0} and therefore kc. 

We now show that the endomorphisms of c are semisimple, for suppose 
A e c is not semisimple. Then from Section 12.2 the algebra K[A] generated by 
A has a nilpotent ideal R. However, since A commutes with every endo¬ 
morphism of the associative algebra g* 9 the set N = Rg* is an ideal of g* 
which is nilpotent. To see the latter we just note N 2 = Rg * * Rg* cz R 2 g* so 
by induction N k a R k g * = {0} for large enough k . Since V is a completely 
reducible ^-module, V is also a completely reducible #*-module. Thus from 
Proposition 9.15, g * is semisimple and therefore we must have N = {0}; that 
is, A = 0. 

To show the converse part of Theorem 10.32 we assume conditions 
(a) and (b) hold and note from Proposition 12.31 we can assume the field K is 
algebraically closed. By (b) and Theorem 11.14 we have the weight space 
direct sum decomposition 

K=F(W+- + F(W 

where V(<j> k ) = {X e V : for all T e c implies (T - 4> k (T)l)X = 0}. By (a) we 
have c is the center of g = c © g' and therefore the weight spaces V(<t> k ) 
are ^-invariant. However, the ^-submodules of each are the same as the 
^'-submodules. Thus if#' is semisimple, we can apply Weyl’s theorem (12.30) 
to conclude each K(<£ k ) is ^-completely reducible. By the above direct sum 
decomposition V is ^-completely reducible. 

If g' is {0}, then each of the weight spaces V((f) k ) is c-completely reducible 
(why ?) so that V is actually ^-completely reducible. 

Remark (2) The preceding direct sum decomposition g = c © g' has 
the following generalization; see the proofs of Jacobson [1962] and Serre 
[1965]. 

Theorem (Levi-Malcev) Let g be a finite-dimensional Lie algebra 
over the field K and let r — rad g . 

(a) There exists a subalgebra h of g such that 

g = r + h 

is a subspace direct sum. Furthermore h ^ gfr is semisimple or {0}. The de¬ 
composition g ~ r 4- h is called a Levi decomposition. 

(b) Let g = r + h = r + kbe two Levi decompositions of g. Then there 
exists (j> e Aut(#) such that <f>(h) = k . Furthermore the automorphism <f> can 
be chosen to be of the form 0 = exp(ad X ) where X e [gg] n r. 
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We can use this to prove the following variation of a result of Lie [Cohn, 
1957, p. 99; Hochschild, 1965, p. 133]. 

Theorem Let g be a finite-dimensional Lie algebra over R. Then there 
exists a real Lie group G whose Lie algebra is g. 

Briefly, we write a Levi decomposition g = r + h and note that this is a 
semidirect sum. Next we use the result of Section 10.2 which states that the 
solvable Lie algebra r is the Lie algebra of a solvable Lie group R . Thus since 
the semisimple algebra A is such that the adjoint representation is faithful, 
then the group H generated by exp(ad A) has a Lie algebra isomorphic 
to A. Therefore the semidirect product G = Rx H has Lie algebra (iso¬ 
morphic to) g. 

The above result also follows from Ado’s theorem which states: If g is a 
finite-dimensional Lie algebra over the field K , then g has a faithful finite¬ 
dimensional representation. Thus if p : g^gl(V) is such a faithful representa¬ 
tion of a real Lie algebra, then the subgroup G generated by exp p(g) a 
GL(V) has Lie algebra (isomorphic to) g. 


5. More on Radicals, Derivations, and Tensor Products 

In this section we use preceding results to discuss relationships between 
radicals of Lie and associative algebras, the complete reducibility of tensor 
products, and derivations of simple nonassociative algebras. Following 
Jacobson [1962] we have the next lemma; compare with Lemma 12.33. 

Lemma 12.36 Let V be a finite-dimensional vector space over the field K , 
let g <= gl(V) be a Lie algebra of endomorphisms, and let A be an ideal of g 
such that every endomorphism in A is nilpotent. Then the associative algebra 
A* generated by A is contained in the radical N of the associative algebra g*. 

Proof The subspace n = g*h* + h* is an ideal of g* noting first 
g*n a #*(#*A*) + g*h * a g*h* a n. 

Next using AX — XA e A for all A e A, X e g we have by an easy induction 
AA...A#<=#A...A + A...A. 
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Therefore A*#* c: #*A* + A* and 

ng* d g*h*g * + A*#* 

<= #*(#*A* + A*) 4- g*h * + A* <= 

Also « k c: #*A* + (A*) k so that from Corollary 11.11 to Engel’s theorem A* is 
nilpotent and therefore for k large enough n k d #*A*. Now by induction we 
can show 

(A*) r #* c= g*(h*) r + (A*) r 

and consequently 

(g*h*) r a g*(h*) r . 

Using this and h* as nilpotent we see that (n k ) r d g*(h*) r — {0} for suitably 
large r . Thus the nilpotent ideal n is contained in N\ that is, A* c: n d N. 

Lemma 12.37 Let V be a finite-dimensional vector space over the field K 
and let g <= gl(V) be a solvable Lie algebra. If N is the radical of g *, then 
g*/N is a commutative (associative) algebra. 

Proof First assume g* is semisimple. Then V is a completely reducible 
#*-module (Theorem 9.16) so that V is a completely reducible ^-module. 
Thus from Theorem 12.32 we conclude that g = c © g' where#' is{0} or semi¬ 
simple (and therefore g f — [g\ #']). However, since g is solvable we must have 
#' = {0}. Thus g*/N — #*/{0} = c* is a commutative algebra. In particular, g 
is an Abelian Lie algebra. 

For the general case we note that since N is an ideal of#* and since as sets 
#* <=: gl(V) = End(F) we can regard N as solvable Lie subalgebra of gl(V); 
still denoted by N. Thus since # is solvable, the Lie algebra (# + N)/N is 
solvable and the associative algebra generated by this Lie algebra is #*/A 
which is semisimple. Thus from the preceding paragraph (# + N)/N is an 
Abelian Lie algebra which means in terms of cosets ( X + N)(Y + N) = 
(Y + N)(X + N) for endomorphisms X , Y e #. However, these cosets 
generate g*/N so that g*/N is commutative. 

Theorem 12.38 Let # be a Lie algebra of endomorphisms acting on the 
finite-dimensional vector space V over the field K and let r be the radical of # 
and N be the radical of #*. Then: 

(a) # rv N is the set of nilpotent endomorphisms in r; 

(b) [r, #] c N. 

Proof Let r 0 be the set of nilpotent endomorphisms of r. Then since N 
is an associative nilpotent ideal, we have g n N c r. Since the endomorphisms 
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of N are nilpotent g n N c: r 0 . If rad(r*) denotes the radical of the associative 
algebra r* generated by r, then from Lemma 12.37, r*/rad(r*) is commutative. 
However, this implies r*/rad(r *) has no nonzero nilpotent elements as follows. 
A nonzero nilpotent element in the commutative algebra generates a nilpotent 
ideal and therefore is in the radical of the algebra; but r*/rad(r*) has 0 
radical. Thus rad(r*) equals the set of nilpotent elements of r* and therefore 
r 0 <= rad(r*). From this we obtain 

r 0 — rn rad(r*) 

and in particular r 0 is a subspace of g. 

We shall next show r 0 is an ideal of the Lie algebra g> For this we use the 
characterization of the radical of an associative algebra A as 

rad A = {Z e A : f(Z, X) = 0 for all X e A} (*) 

where /(X, Y) = trace L(X)L(Y); see Proposition 12.14. First for any 
X , Y,Z e A we have 

/([Z, XI Y) = /(Z, [.X, 7]) 

where [Z, X] = ZX — XZ. Now let A = r* and since r is an ideal in g we 
have for Z eg and X, Y er that 

[Z, XY] = [Z, X]Y + X[Z, 7] e r* 

so that by induction 

[g, r*] <= r*. 

Now let U e r 0 = r n rad(r*) and let Z eg. Then [Z, £/] € r and we now show 
[Z, £/] e rad(r*) as follows. For any X e r* 

/(X, [Z, £/]) = /([X, Z], tf). 

However, [X, Z] e [r*, cz r* and e rad(r*) so that from formula (*) 
we have f([X, Z], U) = 0. Thus /(X, [Z, £/]) = 0 so that from (*) we obtain 
[Z, £/] g rad(r*). Therefore [^, r 0 ] c: r 0 . 

Since r 0 is an ideal in g all of whose elements are nilpotent we can apply 
Lemma 12.36 to conclude r 0 <= rad(#*) = N. Thus since r 0 ag we have 
r 0 a N n g and therefore r 0 = N n g which proves part (a). 

For (b) we use the fact that if L is a Lie algebra of endomorphisms so 
that L* is semisimple, then rad L is contained in the center of L; this follows 
from Theorem 12.32. We apply this to L = (g + N)/N as discussed in Lemma 
12.37 and note that the associative algebra generated by L is L* =g*jN 
which is semisimple or {0}, If L* is{0}, then we are done. Otherwise the radical of 
(g + N)/N is in the center of (g + N)/N. However, (r -f N)/N is a solvable 
ideal of (g + N)/N and therefore in the radical of ( g + N)/N. Thus 

[(r + AT)/X,(^ + X)/X] = {0}; 


that is, [r, g] c= N. 
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Exercise (1) Show how (# + N)/N in the above proof is actually a Lie 
algebra of endomorphisms. 

Corollary 12.39 Let # be a finite-dimensional Lie algebra over the field 
K . Then 

rad(#) = {X eg : Ki\\(X, [AB]) = 0 for all A, Beg}; 
that is, rad(#) = g' 1 relative to the Killing form of g. 

Proof Let r — rad(#) and let A e r and X, Y eg. Then Kill(^, [TY]) — 
Kill([^2f], Y). However, from Theorem 12.38 we have that 

ad [AX] = [ad A , ad X] e [rad(ad g ), ad g] 

which is contained in the radical of the associative algebra (ad#)*. Thus 
ad [AX] ad Y is in the radical of (ad #)* because the radical is an ideal. 
Since this radical consists of nilpotent endomorphisms, 

0 = trace ad[AX] ad Y = Ki\\(A, [XY]). 

Thus A e g' 1 so that r c= #' x . 

For the other inclusion we first note that h — #' x is an ideal of g. Next if 
X eh! = [hh], then we have 0 = KillCY, X) = trace(ad X) 2 . Therefore by 
Cartan’s criterion (Corollary 12.21) we have h is a solvable ideal of g; that is, 
g fl c r. 

Recall [Section 11.2, exercise (5)] that the nilpotent radical n of a finite¬ 
dimensional Lie algebra g is the maximal nilpotent ideal of g. 

Corollary 12.40 Let g be a finite-dimensional Lie algebra over the field K 
and let r be its radical and n its nilpotent radical. Then [rg] cz n. 

Proof From Theorem 12.38 we have [adr, ad#] is in the radical of 
(ad #)*. Since this radical is associative nilpotent, there exists k so that for any 
A u ..., A k e r and X l9 ..., X k eg that 


0 = [ad A- l9 ad X t ] • * * [ad A k , ad X k ] 

~ ad [A x X x \ ad[A k X k ]. 

Thus by Corollary 11.12, the Lie algebra [rg] is nilpotent. However, [rg] is 
an ideal of # (using the Jacobi identity) so that [rg] a n. 

We generalize this result as follows: 
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Corollary 12.41 Let g be a finite-dimensional Lie algebra over the field 
K and let r be the radical of g and n the nilpotent radical of g. Then every 
derivation of g maps r into n. 

Proof Let D be a derivation of g . Then from r = g' 1 we have for X e r 
and A y B eg that 

Kill (DXy [AB]) = -Kill(X, D[AB])= 0 

where we use 

D[AB) = [DA B] + [A DB]eg' and Kill {DXy Y) = -Kill(X, DY) 
from Corollary 12.3. Thus Dr a r. 

Next we shall show that the subspace h = r' + Dr is a nilpotent ideal of r. 
Clearly h is an ideal of r and let 

L = rxKD = r@KD 

be the (external) direct sum of r and the one-dimensional space KD. Define 
a multiplication on L which extends the multiplication of r by 

[X + aDy Y + bD] = [XY] + aDY - bDX. 

This makes L into a Lie algebra in which r is an ideal. However, L/r is one¬ 
dimensional and therefore solvable. Since r is solvable, this implies L is 
solvable. By Proposition 11.9 Z/ = [LL] is a nilpotent ideal in L, but 

L' = r' + Dr 

is contained in r; that is, the subspace h = r' + Dr is a nilpotent ideal of r. 
Next since the nilpotent radical n of g is a nilpotent ideal of r we have that 

k = h + n 

is a nilpotent ideal of r. However, k is an ideal of g as follows. Since n and r’ 
are ideals of g it suffices to show [Dr g] c k . For X eg and ^erwe have 

[DA X] = D[AX] - [A DX] 

where D[AX] e D[rg] cz Dr and [ADX] e [rg] cn by Corollary 12.40. 
Thus [Dr g]a Dr + n a k so that k is actually a nilpotent ideal of g\ that is, 
r' H- Dr + n cz «. Since r' <zn we obtain Dr a n. 

As another application of Theorem 12.32 on complete reducibility we 
have the following results on derivations due to Jacobson for algebras with 
an identity element [Schafer, 1966]. For a nonassociative algebra A, recall 
that the Lie transformation algebra L(A) is the Lie algebra generated by the 
left and right multiplication endomorphisms of A (Section 7.2). 
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Proposition 12.42 Let A be a finite-dimensional nonassociative algebra 
over an algebraically closed field K of characteristic 0 such that A is a direct 
sum of ideals which are simple subalgebras. If D is a derivation of A, then 
D = U + al where U e [L(A), L(A)], I is the identity endomorphism, and 
a e K. 

Proof If A = ^(1) © • • • © A(n) is the given direct sum decomposition 
into simple ideals, then analogous to the proof of Corollary 12.26 we have for 
i = 1, ..., n that 

DAQ) = D(A(i)A(i)) <= A(i)(DA(i)) <= A(i). 

Thus D : A(i) -> A(i) induces a derivation of A(i). Also using the corresponding 
decomposition L(A) = L(A(1)) + • • • + L(A(n)), it suffices to consider the 
case when A is a simple algebra. 

Let if = L(A) be the Lie transformation algebra of A which is generated 
by the left and right multiplications L(X) and R(Y) for all X, Y e A. Since A 
is simple if acts irreducibly on A. Therefore according to Theorem 12.32, 
Z£ = C © it" . Now since C is the center of if, any endomorphism T e C 
commutes with an irreducible set of endomorphisms and therefore T = al 
where a e K, using Schur’s lemma [Proposition 9.14(a)]. Thus C equals KI 

or {0}. 

From [D, L(X)] = L(DX) and [D, R(X)\ = R(DX) we see [D, if] c if. 
However, since if = C © if' with C as above we obtain [D, if'] c if'. 
First we assume if' ¥= {0}. Then the mapping 

5 : if' if': P [D, P] 

is a derivation of the semisimple Lie algebra if'. Thus there exists U e if' 
such that D = ad U. Thus for any F = 6/ + F' e if = C © if' we see 

[A V] = [A V] = Dv = ad U(V) = [U, V] 

so that [D — U, V] = 0. Thus D— U commutes with an irreducible set of 
endomorphisms which implies D — U — al. 

In case if' = {0} we see if = KI. Thus for any X e A, L(X) = al and 
R(X) = pi so that 0 = [A L{X)] = L(DX) and similarly R(DX) = 0. Since 
A is simple, this implies DX = 0; that is, D = 0 which is in if. 

Exercises (2) Let A be a simple nonassociative algebra over K and let 
X e Abe such that L(X) = R(X) = 0. Show X = 0. 

(3) Let A be a simple nonassociative algebra over an algebraically closed 
field of characteristic 0 and let D be a derivation of A. 

(i) If A contains an identity element, then show that D e L(A). 

(ii) If trace D = 0, then show that D e L(A). [This is the case for Lie 
algebras since Ki]|(AY, Y) = -KillfA, DY).] 
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Recall from Section 9.4 that if g is a Lie algebra over the field K and V 
and W are ^-modules with corresponding representations p and a, then the 
vector space tensor product V ® W becomes a ^-module where the action 
of g is defined by 

A(Z X 0 Y) = I P(A)X ® Y + X ® c t(A)Y. 

Proposition 12.43 Let g be a Lie algebra over the field K and let V and 
W be completely reducible ^-modules. Then the tensor product V® IP is a 
completely reducible ^-module. 

We outline a proof as follows. From Proposition 12.31 we can assume K is 
algebraically closed. 

Exercises (4) Show the (external) direct sum P = V © W is a ^-module 
which is completely reducible. Also show V ® W is a submodule of P ® P. 
Thus since a submodule of a completely reducible module is also completely 
reducible, it suffices to show P ®P is completely reducible. 

(5) (i) Let t be the completely reducible representation of g in P and 
assume x is faithful. Show that g = c © g' (direct sum) where c is the center 
of g and g' = [gg] is semisimple or {0}. 

(ii) Show that it suffices to assume t is faithful in (i) for the proof of the 
proposition. 

(6) (i) Let f be the representation of g in P ® P and show that 
x(g) = z(c) © z(g') where ?(c) is the center of x(g) and z(g') is semisimple or {0}. 

(ii) Recalling K is algebraically closed, show that for any Aec, the 
endomorphism f (A) = x(A) ® 1+ I® x(A) is semisimple. [If X, Y eK are 
characteristic roots of x(A), what are the characteristic roots of f(^)?] Thus 
by Theorem 12.32 conclude the Proof of Proposition 12.43. 


6. Remarks on Real Simple Lie Algebras and Compactness 

We combine the results of Section 9.2 on complexification with some of the 
results on semisimplicity to obtain the form of a real simple Lie algebra. 
Analogous to the last part of the proof of Cartan’s criterion for solvability, 
we have the following result. 

Lemma 12.44 Let g be a Lie algebra over K, and let P be the algebraic 
closure of K. Then g is semisimple over K if and only if g = P ® K 9 1S a semi- 
simple Lie algebra over P. 
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Proof Let X x ,..., X m be a basis of g over K , and let b {j = Kill^, X } ), 
let ..., X m be the corresponding basis for g, and let E u = Kill(^ £ , 
Then using the computations analogous to the last part of the proof of Cartan’s 
criterion for solvability [preceding exercise (2)], we see b tj = 5 t7 . Thus g 
(respectively g) is semisimple if and only if its Killing form is nondegenerate 
which is the case if and only if the matrix (6^) [respectively (5 M )] is non¬ 
degenerate. This yields the result, since b {j = . 

For a real Lie algebra g , we see that g is simple over R implies g — C ® g 
is semisimple over C. From Section 9.2 we notice that g need not be simple. 
However, we have the following result. 

Theorem 12.45 Let g be a simple Lie algebra over the reals R . Then g is 
isomorphic to the realification of a simple complex Lie algebra, or g is 
isomorphic to a real form of a simple complex Lie algebra. 

Proof If g = C ® g is simple, then g is a real form for the simple complex 
Lie algebra g. For the other case we have g is not simple, but, from Lemma 
12.44 g is semisimple. Let h be a proper simple ideal in g , and let C be the 
conjugation given by C : g g : X 4- iY -► X — iY. Then it is easy to see that 
h n C(h ) is an ideal in g , and since g is simple, h n C(h) = {0} or h n C(/i) = 
g. In the first case, the set {X + C(X) : X e h} is an ideal in g and therefore 
equals g. Thus g = h © C(h ) using h n C{h) = {0}. Therefore g © g £ g R ^ 

© (CW)r = Ar © An. This shows g is isomorphic to the realification h R of 
the simple complex algebra h. The case h n C(h ) = g is left as an exercise. 

Remark (1) From Theorem 12.45 we see that in order to find the real 
semisimple Lie algebras, it suffices to find the simple complex Lie algebras 
and then find their realifications and real forms. We do this in Chapter 15 
noting that the “realification” part of the problem can be done as follows. 

Proposition 12.46 Let g be a simple real Lie algebra which is not iso¬ 
morphic to a real form of a simple complex Lie algebra. Then g is isomorphic 
to the realification of a simple complex Lie algebra, and conversely every 
realification of a simple complex Lie algebra is a simple real Lie algebra. 

Proof It suffices to show the converse. Thus let g be a simple complex 
Lie algebra, then clearly g R is semisimple. Now let h be a nonzero ideal in 
Sr - Then h is semisimple, and therefore h = [hh\ Now for X, Y e h cz g we 
have by definition of scalar multiplication in g that i[X Y] = [iXY] e [gh\ a h 
noting g = g R as sets. Thus ih = i[hh] c: h so that h is closed relative to scalar 
multiplication by complex numbers. Thus h is an ideal of the simple complex 
Lie algebra g so that h = g; that is, h=g R . 



6. REMARKS ON REAL SIMPLE LIE ALGEBRAS AND COMPACTNESS 


Remark (2) We shall see in Section 15.1 how real forms are related to 
compactness, which we now discuss. 

Theorem Let G be a connected real semisimple Lie group with Lie 
algebra g. 

(a) If G is compact, then its universal covering group is compact. 

(b) The group G is compact if and only if the Killing form of g is neg¬ 
ative definite. 

For the proofs we refer to the work of Hausner and Schwartz [1968] and 
Helgason [1962]. Thus we say a Lie algebra is compact if its Killing form is 
negative definite. For example, the compact Lie group SO(n ) has Lie algebra 
so(n) of skew-symmetric matrices and Kill(2f, Y) = (n — 2) trace XY. How¬ 
ever, since 0 # X is skew-symmetric, we see trace XX < 0. 




CHAPTER 13 

CARTAN SUBALGEBRAS 
AND ROOT SPACES 


In this chapter we start the detailed analysis of semisimple Lie algebras over 
a field K of characteristic 0. In the first section the Cartan subalgebra h of a 
Lie algebra g is introduced. Since h is a nilpotent Lie subalgebra of g, we 
consider the weight space decomposition of g relative to the nilpotent Lie 
algebra of endomorphisms ad 9 h. Thus in the second section we consider the 
relationships between these weight spaces for a split semisimple Lie algebra 
and give examples. It turns out that a split semisimple Lie algebra g is a sum 
of simple subalgebras which are isomorphic to sl( 2, K). Consequently, in the 
third section we discuss the representations of 57(2, K) and apply this to a 
further analysis of the weight spaces of g. 


1. Cartan Subalgebras 

All Lie algebras in this chapter will be assumed to be finite-dimensional 
over a field K of characteristic 0. Recall that a Lie algebra g over K is said to 
be “ split ” if for each Xeg all the characteristic values of ad X are in K. All 
algebras over an algebraically closed field, in particular the complex numbers, 
are split. 
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Definition 13.1 Let g be a Lie algebra over K. 

(a) For any subalgebra h of g let N(h) = {Xeg : [Xh] e /;}, then N(h) 
is called the nonnalizer of h and is the largest subalgebra of g which contains 

h as an ideal. . , 

(b) A subalgebra h of g is called a Cartan subalgebra if it is mlpotent and 

N(h) = h. 

For any X eg and A e K set 

g(X, X) = {Yeg : (ad X - Xl) n Y = 0 for some it}. 

Notice that if A is a weight of ad X , then g(X, X) is merely a weight space 
of g for the linear transformation ad X. If A t , • • •, A m , the weights of ad X, all 
lie in K, then it is clear that g = g{X,, X) + • • • + g(X m , X) as a vector space 
direct sum; recall Section 10.3. From exercise (2) of Section 11.3 we have 
that if A + g is a weight, then 

[g(X,X)g(g, X)]^g(X + g,X) 

which implies #(0, X) is a subalgebra of g and [tf(0, X)g(g, X)] <= g{g, X). 
Briefly, to see this we note that D = ad X is a derivation of g. Then for 
U e g{X, X) and V e gig, X) 

(D — (A + g)l)[UV] = [(£> - XI)U V ] + [U{D- gI)V]. 

Thus by induction we obtain 

(£> - (A + g)l) p [UV] = t o (fy[( D - M) k U(D - giy~ k V] 

and for p large enough, the right side of this equation is 0; that is, 
[UV] e g(X + p, X). Note that if X + g is not a weight of ad X, then 

I [g{X, X)gig, X)] - {0}. 

An element X e g is called a regular element of# if the dimension of #(0, X) 
is minimal. This minimal dimension is called the rank of g. 

Example (1) Let g = gl( 2, K ) the Lie algebra of 2 x 2 matrices over K. 
As usual let / denote the identity matrix 



One can compute that #(0, /) = #(0, E) = g so / and E are not regular. But 
^ = YU + kh which is equal to the diagonal matrices of g and it is not 
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too difficult to show that H is regular, g(0, H) is a Cartan subalgebra of g , 
and so K) has rank Z 

Proposition 13.2 Let g be any finite-dimensional Lie algebra over a field 
K of characteristic 0, X e g a regular element, and h = #(0, X ). Then h is a 
Cartan subalgebra of g. 

Proof We must show that h is nilpotent and that N(h) ~ h. By Engel’s 
theorem we can show that h is nilpotent by showing that for all H e /*, ad H 
acts as a nilpotent linear transformation on h . 

Let p(t) = det(ad X — tl) = fq(t) be the characteristic polynomial of ad X 
where r = dim h is the rank of g and t and does not divide q(t). We then have 
h = {Yeg : (ad X) r Y = 0} and if we let k = {Yeg : q(ad 2 Q(T) = 0 }, the 
primary decomposition theorem of linear algebra tells us that g = h + k y a 
vector space direct sum, and that h and k are invariant under ad X ; that 
is, (ad X)h c= h and (ad X)k cz k . In order to show that h is nilpotent 
and N(h) = h we can extend the field K to its algebraic closure if need be 
and so we may assume that all the weights of ad X lie in K. In this case k = 
ffd* X) where the sum is over nonzero weights of ad X. 

Now assume Y = H + Y t + Y 2 -+-•••+ Y m e N{h) where Y { e ^(2 f , X\ 
Af # 0, and H e h = ^(0, X). Since Xe h we have 

[XY] = [XH] + [XY t ] + • • • 4- [XY m ] e h. 

Thus using the above direct sum g = h + k and using the ad JiT-invariance 
of 0 (Af, X) we see [XY t ] = 0. Thus Y t sg(X i9 X) n h ={0}. Consequently 
Y = Heh and N(h) ** h. 

To show that ad H is nilpotent on h for all H eh we first choose a basis 
Xi , X 2 ,..., X n for g with X t , X 2 ,..., X r e h and X r+1 ,..., X n e k. With 
respect to this basis the matrices associated with the linear transformations 
ad X and ad H look like 


ad X — 

i O' 

ad H = i 

: O' 

C 

1 By 

c 

1 D . 


where we know that ^isanr x r nilpotent matrix and B is an (n — r) x (n — r) 
nonsingular matrix, and we want to show that the r x r matrix C is nilpotent. 
Assume to the contrary that C is not nilpotent and let jc, y, and t be three 
indeterminants. Define 

p(x,y, t) = dct(xad X + ^ad H - tl) = det (xA + yC - tl) det(x£ + yD - tl). 

Then B nonsingular implies t does not divide det(B - tl) nor does t divide 
det(xB + yD - tl\ and C not nilpotent implies f does not divide det(C - tl) 
nor does f divide det (xA + yC — tl). Thus f does not divide p(x , y, t). 
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However, then one can find a, P e K so that f does not divide p( a, /?, 0 = 
det(ad(aA'+ fM) - tl) which is the characteristic polynomial of ad(aA" 4- PH). 
This contradicts the fact that X is regular and completes the proof. 


Corollary 133 Every Lie algebra possesses Cartan subalgebras. 

Proposition 13.4 If A is a Cartan subalgebra of a Lie algebra g and X eh 
with X regular, then A = g( 0, X). 

proof For any Yeh, (ad X) m Y = 0 for some m because h is nilpotent 
so y e ^(O, X) and h £ 0(0, X). 

Now assume that h * 0(0, X). Then N(h ) = h implies that (ad Z)h £ A for 
any Z e 0(0, X) with Z £ h. However, then by induction, for any positive 
integer n, there exist H t , H 2 , • • •, e h with 

and this contradicts the nilpotence of 0(0, X); note Proposition 13.2. 

Proposition 13.5 If 0 is a split Lie algebra over K and A is a Cartan sub- 
algebra of 0, then A = 0(0, X) for some regular Xeh. 

Proof By the previous proposition we need only show that A contains a 
regular element. Since g is split and since ad # (A) is nilpotent g = 
h + g 1 + • • ■ + g m , a vector space direct sum of weight spaces of the weight 
functions of ad 9 (A); Section 11.3. The condition N(h) = A guarantees that A 
is precisely the weight space of the weight function which is identically 0 on 
ad (A). Thus for each i = 1, 2,..., m there exists an H, 6 A with ad H t acting 
as a nonsingular linear transformation on 9i . Now a standard argument over 
infinite fields can be used to choose an H e A equal to a linear combination o 
the H t ' s such that ad H is nonsingular on g t + • • * + g m ■ Then 0(0, H) = A 
and H is regular as required. 


Remarks (1) This proposition can be used to help prove that if K is 
algebraically closed, then any two Cartan subalgebras of g are conjugate by 
an automorphism of 0 [Jacobson, 1962, p. 273; Hausner and Schwartz, 1968]. 

(2) In the next section we consider split semisimple Lie algebras so by 
Proposition 13.5 in this case all Cartan subalgebras will contain regular 
elements and the dimension of any Cartan subalgebra will be equal to the rank 


of the algebra. . 

(3) Let A be a Cartan subalgebra of g. Then since A is nilpotent, ad g A is a 

nilpotent Lie algebra of endomorphisms acting cn g, for if n is an integer such 
that for all H l ,H 2 ,...,H n in A we have [ ■ ■ ■ [HtH 2 ] ■ ■ ■ H„] = 0, then 
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[ * “ [ad g , ad g H 2 ] • * *, ad s H n ] = ad g ([ * * • [Hiif 2 ] • • • H n ]) = 0. In the next 
section we use results of Section 11.3 on decomposing a vector relative to a 
nilpotent Lie algebra of endomorphism to find a weight space decomposition 
of g relative to ad^ h. 


2. Root Spaces of Split Semisimple Lie Algebras 


In this section g will always denote a split semisimple Lie algebra over K 
and h will always denote a fixed, but arbitrary, Cartan subalgebra of g. Since 
ad g h — {ad H : g^g : H eh) is a nilpotent algebra of split linear transfor¬ 
mations of g we know that g is a vector space direct sum of the weight spaces 
of the weight functions of ad ff h. 

Definition 13.6 The weight functions of ad ff h are called the roots of g and 
their weight spaces are called the root spaces of g ; note Definition 10.15. 

Remarks (1) Strictly speaking the weight spaces should be called the 
“ root spaces of g with respect to h but we will see later than the structure of 
root spaces does not depend on the choice of h . 

(2) Using the previous notation for weight spaces g(<x) will denote the root 
space of the root a. Proposition 13.4 implies that ^(0) = h . It will be very useful 
to express g as the vector space direct sum 

g = h + Y, #(«)> 

ae 9t 

where M denotes the nonzero roots of g. As in Section 13.1 we obtain for 
a, ft roots of h that 

r / ^ ,Q\ i f#( a + P) if a + P is a root, 

M«)M1 <=(f 0) othen>ise . 

Proposition 13.7 Let g be a split semisimple Lie algebra over K, h a 
Cartan subalgebra of g , and as usual let KilfiZ, Y) denote the Killing form 
of g. 

(a) If a, P are roots of g with a + p ^ 0 and X e ^(a), Y e g(p ), then 
Ki\\(X, Y) = 0. 

(b) If a e then -ael and dim(#( —a)) — dim(#(a)). 

(c) KilKJf, Y) restricted to h is nondegenerate. 

(d) [hh] = {0}. 
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Proof (a) Let T = ad Xad Y and notice that T\g(y)) c g(y + k (a + /?)) 
for those positive integers k for which y + k (a + /?) is a root of g . Since the 
number of roots are finite we have that, for some large enough k,y 4- k {a + P) 
will not be a root of # and T k (#(y)) = {0}. Since g is a direct sum of its root 
spaces, this implies that T is a nilpotent linear transformation so that Kill 
( X , Y) = trace T=0. Notice that we have included the possibility that a 
or P is the zero root and have used only the fact that a + ft ^ 0. 

(b) Suppose a e 0t but — a $ Then from (a) it follows that 
Kill(#(a), g(p)) = Oforall roots P of g and Kill(#(a), g) = 0 which contradicts 
the nondegeneracy of Kill^, Y) on#. In fact, general arguments about non¬ 
degenerate bilinear forms allow us to conclude that g(g) and g( - a) have bases 
X x , X 2 , ..., X n and Y x , Y 2 ,..., Y n , respectively, with Kill(Y f , Xj) = Kill 
(Y i9 Yj) = 0 for all ij and Kill(Y £ , Y^) = which is the Kronecker 
^-symbol. In particular, the dimensions of g( a) and #( —a) are the same. 

(c) 0 t ^ H x e h implies Kill^, £ aei a#(a)) = If also Kill(i?i, h) = 0, 
then Kil^JJj, g) = 0 contradicts the nondegeneracy of Kill(Y, Y); so there is 
some H 2 e h with Kil \(H X , H 2 ) =£ 0. 

(d) ad g h is a nilpotent and therefore a solvable Lie subalgebra of #/(#). 
By Cartan’s criterion for solvability we have 

KilKHj, [. H 2 H 3 ]) = trace(ad H x )(nd[H 2 H 3 ]) - 0 
for all H x , H 2 , H 3 e h; that is Kill(/z, [hh]) = 0. Now (c) implies [hh] = 0. 

Definition 13.8 (a) Let A* denote the dual space of A; that is, h* is the 
set of linear functions from h into K. Notice that is a finite subset of h*. 

(b) For each ae^ let H a denote the unique element of h such that 
K\W(H a , H) = oc(H) for all H eh. It is clear that such an H a exists because of 
an elementary result which states that the conclusion is true for any symmetric 
nondegenerate bilinear form on a vector space and any function in the dual 
space of the vector space. 

Proposition 13.9 Let g be any split semisimple Lie algebra with the 
notation as before. 

(a) Then Kil \(H X , H 2 ) = 'Z*e# n **( H M H 2 ) for all H x ,H 2 eh where n a 
is the dimension of #(<x). 

(b) Then M spans h*. 

(c) If Xeg(oc), Yeg(-oc), and (ad H)X = <x(H)X for all H eh, then 
[XY] = Kill(JY, Y)H a . 

Proof (a) KillC^ , H 2 ) = trace(ad H x ad H 2 ) and the formula follows 
by looking at the matrices for ad H x and ad H 2 in Jordan canonical form. 
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(b) Let feh*. Then using the remarks in Definition 13.8, we have an 
Hj-eh such that f(H) = Kill(i/j, H) for all H e h. However, from (a), 
f(H) = £ n a ot(H f )oc(H) so that 0Z spans h*. 

(c) For any H e h, 

Kill([AT], H) = Kill([ HXl Y) 

= ol{H) Kill(JT, Y) = Kill(Kill(X, Y)H a9 H) 

and the formula follows from the nondegeneracy of Kill(JF, Y) on h. 

Proposition 13.10 Let g be a split semisimple Lie algebra as before. Then 
the root spaces g( a) for a e0t are not only weight spaces but are in fact 
characteristic spaces; that is, [HX ] = oc(H)X for any H eh and Xeg( a). 

Proof Choose a basis for g so that the matrices for each ad H with H e h 
restricted to g(cc) look like 


'oc(H) * " 

0 a (H)_ 

For a fixed H eh write ad H = S + N where S is semisimple and N is nil- 
potent. Then S(J£) = ct(H)X for any X e g( a) and since [g{a)g(P)] ^ ^(a + p) 
one finds that 

5([Z7]) = (a + P)(H)[XY] = [a (H)XY] + [Xp(H)Y] = [S(Z) Y] + [ZS(7)] 

for all X e g(cc) and Y e g(P). This shows that S is a derivation of g and from 
Section 12.3, S = ad Z for some Z e g. By noticing that S — ad Z consist of 
the diagonal part of the matrix for ad H we find that S = ad Z commutes with 
all the matrices of ad 9 h and so [Zh] = {0} and Z e A'(/;) = h. We must show 
that S = ad Z = ad H ; that is, N = ad(Jf - Z) is 0. The matrix for ad(H - Z) 
is strictly upper triangular so Kill^ , H - Z) = trace(ad H y ad (H - Z)) = 0 
for all Hi e h, and H — Z — 0 by the nondegeneracy of Kill(JSf, 7) restricted 
to h. Thus ad (H — Z) = 0 as required. 

Remark (3) Notice that Proposition 13.10 implies that the formula in 
Proposition 13.9 (c) holds for all Xeg(oi). This formula and some facts about 
representations of sl{ 2, K ) will be used in the next section to prove some 
further properties of root spaces. First, however, this section will be concluded 
with some examples. 
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Examples (1) Let g = sl(n, K ) and let E u be as usual, the matrices in 
gl(n, K ) with 1 in the ith row and jth column and zeros elsewhere. We first 
compute the Killing form for slip , K). Since I commutes with everything in 
gl(n , K) and gl(n , K) = KI + sl(n, K ), the Killing form of K ) is equal to 
the Killing form of ^/(«, K) restricted to sl(n, K). Thus for any A - [a y ], 
J3 = [Ay] e# = sl(n 9 K), Kill(^, B) = trace(ad A ad 5) where the trace is 
computed by using the E t /s as a basis for ^/(«, K). 

(ad A ad B)£y = (ad A) £ (6 W E W - ^£ it ) 

*=1 

n n 

“ X! X! (Pmk bki E m j Gj m A*,- ^mk Ay* 

m = 1*=1 

This contributes £j* =1 (a iJk A* f + a kJ b Jk ) — o^-Ah - a tl A jV to the trace and if we 
sum over all 1 <i,j<n we find 

Kill(,4, B) = 2n £ Y^ciijbji- 2 ^ X = 2rt(trace(.4B)) 

i=ij=i 

since traced) = £" =1 a H = 0; note exercise (3), Section 12.2. 

Notice that Kill(^4, £) = 2«(traceG4U)) is nondegenerate which shows 
that sl(n , X) is semisimple. 

Let h = {£?=! j a n = 0} which is the set of all diagonal matrices 

of sl(n , K). It is easy to check that A is a Cartan subalgebra of g — sl(n , K) and 
so g has rank n — 1. The nonzero roots of g are ^ = {<x jk :j^k 9 \<j,k<n} 
where 



— Qjj — 


and the corresponding root spaces are 


£(«,*) = KE jk . 


Notice that — 0 ^ = <x kJ and that each g(oL Jk ) is one-dimensional. If we define 
H Jk = (Ejj- E kk )/2n, then, for any H = a u E u e h. Kill (H jk ,H) = 
ajj — a kk so H ajk — H Jk where H ajk is given in Definition 13.8. It is also very 
easy to check that the conclusions of Proposition 13.9 also hold for this 
algebra. Thus sl(n, K) will be an important example for later results and the 
above formulas will be used at that time. 

(2) Consider the derivation algebra of a split Cayley algebra # 

over K as described in example (3) at the end of Section 9.6. Thus = 
{D(A , x, y ): A e sl( 3, K ) and x, y ei^ — K 3 }, where 'Y' is the three- 
dimensional vector space of column vectors over K. The action of the 
derivations on # can be found in Chapter 9. However, the Lie multiplication 
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in is of more interest to us now and for convenience will be displayed 
once more. 

[D{A X , x x , y x ), D(A 2 , x 2 , y 2 )] = D(A 3 , x 3 , j> 3 ) 

where 

A 3 = [A x , A 2 ] + 3x l y 2 t - 3x 2 y t * + ((x 2 , y t ) - , y 2 ))I, 

x 3 = A t x 2 - A 2 x, - 2y t x y 2 
y 3 = -A l ‘y 2 + A 2 'y x + 2x x x x 2 . 

The Killing form for can be computed as in the previous example using 
any convenient basis for One finds that 

Kill(Z)(^,, x,, j-i), D(A 2 , x 2 , y 2 )) = 8 trace(A x A 2 ) + 24(x 1; y 2 ) + 24(x 2 , y x ) 

so is semisimple since the Killing form is clearly nondegenerate. 

To examine some of the other concepts of this chapter set g = and 
h = { D(H , 0, 0) : H e ^7(3, K), H diagonal}. For D(H , 0, 0) e /z the multiplica¬ 
tion simplifies to 

[D(H, 0, 0), D(A, x , j>] = D([ff, ^], Hx , -tfy). 

It is now easy to check that /* is Abelian and self-normalizing and so is a 
Cartan subalgebra of g , and # has rank 2. 

Let 


T 


O' 


'O' 

0 

, E 2 = 

1 

. e 3 = 

0 

0 


0 


1 


be the usual basis for tT = K 3 and let E ijy 1 < i,y < 3, be as in the previous 
example. Set H = YJ=ia ii E ii e sl( 3, X) so that a tl + a 22 + a 33 = 0. The 
roots and root spaces for g are now given in Table 13.1. 

Notice that after we discover that g is a direct sum of its roots spaces we 
then know that g is in fact a split Lie algebra or at least that all the characteris¬ 
tic roots of ad H for H e h are in K. It will become apparent in the next section 


TABLE 13.1 


1 

a t (H) 



1 

#11 #22 

KD(E 12 , 0, 0) 

, 0 , 0 ) 

2 

#11 —#33 

KD(E 13 , 0,0) 

KD(E 31 , 0, 0) 

3 

#22 — #33 

KD(E23 ,0, 0) 

KD(E 32 , 0, 0) 

4 

#11 

/O>(0, E x , 0) 

KD( 0 , 0 , 

5 

#22 

KZ>(0,£ 2 ,0) 

AT/>(0, 0, £ 2 ) 

6 

#33 

«>(o, e 3 , 0) 

0, £3) 
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that this implies that g is split. Notice that being a split Cayley algebra 
implies that is a split semisimple Lie algebra even though the definition 
of a split Cayley algebra did not even involve characteristic roots of a linear 
transformation. It is in fact known that for any Cayley algebra is 

split if and only if # is split as a composition algebra. The following exercises 
are related to the examples in Section 2. 

Exercises (1) For any integer n ^ 1 let 

'1 0 -O ' 

, _ 0 O n I n 

"it * 

.6 i n o n _ 

be a (2 n + 1) x (2 n + 1) matrix where O n and I„ are, respectively, nxn zero 
and n x n identity matrices. Let 38(n, K) be the Lie algebra of all (2 n + 1) x 
(2 n + 1) matrices A over K such that J n A'J„ = —A. The notation for 0t{n, K ) 
is chosen for historical reasons and will be explained in the next chapter. Verify 
the following facts about 3S(n, K): 

(i) 3S(n, K) is just the set of matrices 

0 u v 

-v‘ R S 

-u' T -R\ 

where u, v are n x 1 matrices, R, S, T are nxn matrices, S’ - —S,T ,= —T. 
The dimension of @){n, K ) is 2 n 2 + n. 

(ii) KillOL B) = (2 n - 1) trace AB for any A, Be ®{n, K ) and so 
3d{n, K ) is a semisimple algebra. 

(iii) The subalgebra h of @(n, K ) of diagonal matrices in @(n, K ) is a 
Cartan subalgebra. The rank of 38(n, K ) is n. 

(iv) Let H = ^ =1 a i+1 , i+1 (£ l+ i, l+ i - E i+n+1 , i+n+l ) be an arbitrary 
element of h. The roots and root spaces of g — iM{n, K) are described in 
Table 13.2. 


TABLE 13.2 



Root A 

Km 

Basis for #(A) 

a i.j> 

1 <*,/<», i^j 

Ol + l,i + l a J+l t J+l 

El + l,j +1 — Ej + n + 1. i + n + l 

fr.j. 

1 <i<j<n 

<*l + 1, 1 + 1 + a J + l t J+l 

^I + l, J + n + 1 — Ej + i t i+n + l 

yi.j > 

1 <i<j<n 

—Oi + 1,1 + 1 “ a J+t,J+l 

Ei + n + 1. f+ 1 Ej + n + i t f + i 

Si, 

1 < i < n 

“fli+i,i+i 

■E^l.t + l — ^i + n + l.l 

£i » 

1 < * < #i 

flf+i,i+i 

£i. i+« + i ^i + i. l 



274 


13. CARTAN SUBALGEBRAS AND ROOT SPACES 


(2) For any n ;> 1, let 



be the 2 n x 2 n matrix where O n and /„ are as in the previous exercise. Let 
2>(n, K ) be the Lie algebra of all In x 2 n matrices A over K such that 
L rt A'L n = — A. Verify the following facts about Qt{n, K): 

(i) 3>{n, K ) is just the set of matrices 



where R, S, T are n x n matrices and S' = — S, T‘ = —T. The dimension of 
@(n, K ) is 2 n 2 - n. 

(ii) Kill(/4, B) = {In — 2) trace AB for any A, Be @(n, K), so 3>(n, K ) 
is semisimple. 

(iii) The subalgebra of 3>(n, K) of diagonal matrices is a Cartan sub¬ 
algebra of 3(n, K). The rank of £?(«, K ) is n. 

(iv) Use (iv) of the previous exercise to produce a table of roots and 
root spaces of Qs(n, K). 

(3) For any n >. 1, let 



be the 2n x 2n matrix where 0„ and /„ are as before. Let <g(n, K ) be the Lie 
algebra of all In x 2 n matrices A over K such that M n A'M~ l = —A. Verify 
the following facts about #( n , K): 

(i) K ) is just the set of matrices 



where R, S, T are n x n matrices and S' = S, T' = T. The dimension of 
<#{n, K) is 2 +n 2 n. 

(ii) Kill(^B) = (2 n + 2) trace AB for any A, Be <#(>n , K) so %(n, K ) is 
semisimple. 

(iii) The subalgebra of <g(n, K) of diagonal matrices is a Cartan sub¬ 
algebra. The rank of n, K) is n. 

(iv) Let H = £" =1 a, ,(£, , - £ 1+ „ i+ „) be an arbitrary element of h. 
The roots and root spaces of g '= <g(n, K) are described in Table 13.3. 

Remark (4) The examples and exercises above include all but four of 
the simple split Lie algebras over a fixed field K of characteristic 0. The proof 
of this fact (which follows in the next chapter) includes many algebraic com¬ 
putations, but the reader should be assured that the above matrix computa- 
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TABLE 13.3 


Root A 

A (H) 

Basis for g{ A) 


Oi, i Qjtj 

E it j — Ej+ Htl+n 

1 <i<j<n 

at, t 4* a Jt} 

+ » + Ej t ! + „ 

y,.j, 1 <i<j<n 

— ai t i — Qj.j 

El + n,J + £j + n, i 

28 ,, 1 < i < n 

—2a u 

£t + n. f 

2e lt \<i<n 

2a u 

El, t + B 


tions comprise the bulk of the tiresome details needed in describing the simple 
Lie algebras. 

(5) The algebras 39(n, X), #(/i, K), and 0>(n, K ) are often described as 
the set of linear transformations that are skew with respect to certain bilinear 
forms. Thus let J be one of the matrices J„,L n , or M n appearing in the exer¬ 
cises and let X and Y be column vectors of the appropriate size so that 
B(X, Y ) = X'JY defines a bilinear form. Notice that B(X, Y) is nondegenerate 
in all three cases, that it is symmetric in the first two cases, and that it is anti¬ 
symmetric in the third case. For a linear transformation A on the given vector 
space of column vectors let A* be the unique linear transformation such that 
B(AX, Y) = B(X, , 4 * T) for all and Y, one can easily check that A* = J~ 1 A , J. 
Then g, the Lie algebra of all linear transformations with A* = —A, is iso¬ 
morphic to the Lie algebras of matrices that were described in the exercises. 

(6) Summarizing some of the results for a split semisimple Lie algebra 
g we have 

9 = h + X g(a) = h + £>(a) + g(-a), 

where h is a Cartan subalgebra of g and a varies over 01. The second sum is not 
direct, since we are duplicating some root spaces. In the next section we will 
show h = £ KH„ for H a e h of Definition 13.8 and a varies over 01. Thus 

g-^KH' + gi*)+g(-«), 

and we shall show KB, + g(a) + g(- a) is isomorphic to sl( 2, K); that is, g 
is built up of three-dimensional Lie subalgebras which are isomorphic to 
sl( 2, K ). 


3. Irreducible Representations of sl(2, K) 

The following basic theorem completely characterizes the finite-dimen¬ 
sional irreducible representations of sl( 2, K). This theorem will be used to 
obtain further properties of roots of split semisimple Lie algebras. 
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Theorem 13.11 Let g be an abstract Lie algebra over K with basis H, E, 
F where [HE] = 2E, [HF] = —IF, and [EF] = H so that g is isomorphic to 
sl{ 2, K ). For any positive integer n>2 define a linear map p:g^> gl(n, K ) by 
setting 

~n — 1 0 

0 n — 3 
0 0 

P(H) = 

0 


"0 n - 1 0 

0 0 n - 2 

0 0 0 

p(E) = 

0 


"0 0 0 

10 0 0 

0 2 0 

P(F) = 

0 0 0 

0 n-2 0 0 

0 n — 1 0 

Then p is an irreducible representation of g. Conversely, given any finite¬ 
dimensional irreducible representation p : g -> gl(V), there exists a basis for 
V so that the matrices for p(H), p(E), and p(F) are precisely those above with 
n being the dimension of V. 

Proof A straightforward matrix computation will show that [p(H), 
p(E)] = 2 p(E), [p(H), p(F)] = —2p(F), and [p(E), p(F)] = p(H). Then p is a 
representation of g. To show that p(g) acts irreducibly on the n-dimensional 
vector space V of column vectors we first consider the usual basis for V. Thus 
A^is the vector with a 1 in the &th row and zeros elsewhere. Let X = Y,k=i x k A* 
be any nonzero vector in V. We must show that any subspace of V containing 
X which is invariant under p(g) must be equal to V itself. 
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Suppose x k — 0 for 1 < k < m < n but x m / 0. Then notice if m ^ n that 
p(F)X = kx k X k+1 = '£ kx k X k+x 

fc=l k-m 

so p(F)"~ m X = m(m + 1) • • • (« — \)x m X n . Thus any nontrivial invariant 
subspace of V contains X n . However, p(E) k X n — k\ X„. k so any invariant 
subspace contains all the vectors in the basis of V and must be V itself. We 
have shown that p is an irreducible representation of g and will now show that 
these are the only possible finite-dimensional irreducible representations. 

Conversely, given p, we first claim that there exists a 0 ^ X€ V with 
p(H)X = aXfor some a # 0 and p(E)X = 0. So first assume K is algebraically 
closed so that p(H) has a characteristic vector, say p(H)Y —bY. We will see 
shortly that b must be an integer so at that point we no longer need the as¬ 
sumption of algebraic closure. If p{E)Y = 0, we are finished. Otherwise let 
Y 2 = p(E)Y and notice that 

p(H)Y 2 = p(H)p{E)Y 

= (p([HE]) + p(E)p(H))Y 
= 2p(E) Y + bp(E) Y = (b + 2)Y 2 . 

If p(E)Y 2 # 0, set Y 3 = p(E)Y 2 and compute that p(H)Y 3 = (b + 4) Y 3 . In 
this way we obtain a sequence Y, Y 2 , Y 3 ,... of characteristic vectors of 
p(Fl) each with a different characteristic value so the vectors of this sequence 
are linearly independent. Since V is finite-dimensional there can be only a 
finite number of vectors in this sequence; that is, p(E) Y k = 0 for some k and 
Y k is the vector desired. 

Let X x be a vector as constructed in the previous paragraph so 
p(H)X x = aX x and p(E)X x = 0. For each positive integer k define X k = 
p(F) k ~ 1 X i /(k - 1)1 = p(F)X k -J(k - 1). As in the previous paragraph it is 
easy to verify that p(H)X k = (a-2k + 2)X k so only a finite number of the 
X k s are nonzero. Choose n so that X„ is the last nonzero vector in the sequence. 
The X u X 2 ,..., X„ form a basis for V since the following three formulas 
show that the subspace spanned by these vectors is invariant under p{H), 
P(F), p(F ) 

p(H)X k = (a-2k + 2)X k , 

p(F)X k = kX k+1 for 1 <k<n, p(F)X n = 0, 

p(E)X k = (a-k + 2)X k _ x for 2 < k < n, p(E)X x = 0. 

We must still prove the last formula. However, first notice that the formulas 
precisely coincide with matrix entries we are interested in if a = n — 1. 

To prove the formula for p{E) notice that 

p(E)X 2 = p(E)p(F)X i 

= ( p(H) + p(F)p(E))X x = aX x . 
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Now proceed by mathematical induction and assume the result for k - 1. 
Then 

(k - 1 )p(E)X k = p(E)p{F)X k _ 1 

= (p(H) + p(F)p(E))X k . l 
= (a-2k + 4)X k _i + (k- 2)(a - k + 3)^., 

= (k-\)(a-k + 2)X k . 1 . 

The desired formula now follows. Finally to show that a = n — 1 notice that 
the matrix for p{H) = [p(E), p{H )] is of trace 0 so a + (a - 2) + (a - 4) + • • • 
+ (a - 2n + 2) = 0 = na - n{n — 1). This completes the proof of the theorem. 

Example (1) If n = 2 in the theorem, one obtains the usual matrices 
for si(2. K). 

Exercise (1) The algebra 3t(\, K) described in the first exercise of the 
previous section is a three-dimensional algebra of 3 x 3 matrices and is iso¬ 
morphic to 5/(2, K ). An explicit isomorphism from the algebra g described in 
Theorem 13.11 to £0(1, K) is indicated by H -*• H', E-> E', F-+ F', where 



"0 

0 

0‘ 


“0 

0 

-2' 


- 0 

1 

O' 

H' = 

0 

2 

0 

, £' = 

2 

0 

0 

, F' = 

0 

0 

0 


0 

0 

-2 


0 

0 

0 


-1 

0 

0 


Show that the matrices of #(1, K) describe an irreducible representation of 
sl(2,K) of degree 3. Find a nonsingular 3x3 matrix P so that PH'P ~ 1 = p(H), 
PE'P 1 = p(£), PF'P 1 = p(F), where p(H), p{E), p{F) are as in Theorem 
13.11 with n = 3. 

Remark (1) For the remainder of this section we will return to in¬ 
vestigating split semisimple Lie algebras. The notation will be as in the 
previous section so g is a split semisimple Lie algebra over a field A" of charac¬ 
teristic 0 and h is a Cartan subalgebra of g. We know that g = h + E ^ g(a) 
as a vector space direct sum where denotes the nonzero roots of g and each 
root space g(a) is a characteristic space. We wish to find a subalgebra of g 
isomorphic to 5/(2, K). To find such a subalgebra we can use Proposition 
13.9(c) that says that for any Xeg( a) and Yeg(- a) we have [XY] = 
Kill(Z, Y)H a where Kill(H, H x ) = <x(H) for all H eh. 

Proposition 13.12 With the notation as above we have: 

(a) a(/7J = Kill(// a , HJ Y 0 for each a e 

(b) the dimension of g(tx) is 1 for each aeS; 

(c) if we define h( a) = [g(a)g( - a)] = KII X , then h(a) + g(a) + g( - a) is a 
subalgebra of g isomorphic to 5/(2, K). 




3. IRREDUCIBLE REPRESENTATIONS OF sl( 2, K) 279 

Proof (a) Suppose for a fixed a 6 01 we have oe(H a ) = 0. Then ehoose 
any Xeg(oc), Yeg(— a) so that Kill(A, T) = L Notice [H a X] = «(/£*)A = 0, 
[H a Y] = 0, [XY] ~ H*. Thus H a , X, Y span a three-dimensional nilpotent 
subaigebra of g and ad H a , ad X, ad Y span a three-dimensional nilpotent 
(and solvable) subalgebra of gl(g). Since ad H a = (ad X> ad Y ] and 
[ad H a , ad X] =0, we have by Lemma 12.34 that ad H a is a nilpotent 
linear transformation. By Proposition 13.10, ad H a is a diagonalizable linear 
transformation. A linear transformation which is both nilpotent anddiago- 
naiizable must be 0 so ad H a — 0. This clearly contradicts the semisimplicity 
of g . 

(b) and (c) For a fixed ael choose any X e g( a) and Y e g( — a) so that 
KM(X, Y) = 2/a(H a ) and set H = 2Hjoc(H a ). Then [HX] = oc(H)X = 2X, 
(i/yj = — 2 y, and [XY] = Kilt(A, Y)H a = H so g, the subspace of g spanned 
by H, X , y, is a subalgebra of g isomorphic to s7(2, X). To complete the proof 
we must show that g(<x) is one dimensional or equivalently that g( — a) is one 
dimensional. 

Suppose the dimension of g{ — <x) is greater than 1, then choose some 
0 ^ Z^g(-tx) with Kill(A", Z) =0; note the proof of Proposition 13.7(b). 
Then 

(ad X)Z = [XZ] = Kill( X, Z)H a » 0, (ad H)Z = [HZ] « - a(H)Z « - 2Z. 

This is precisely the situation described in the last part of the proof of Theorem 
13.11, except for an unfortunate clash of notation. We could follow the com¬ 
putations there to show that Z t = Z, Z 2 = (ad T)Z, Z 3 = (ad Y) 2 Z/2l, 

Z„ = (ad Y) n ~ 1 Z/(n — 1)! forms a basis for a subspace W of g invariant under 
ad X , ad y, and ad H. Furthermore we can conclude from previous com- 


putations that ad H restricted to W has a matrix 


"-2 

0 

0 


0 


0 0 

W) = 

0 

0 

-6 



0 

— 2 « 


where 2 = ad restricted to W. On the other hand 2(H) = [2(A), 2(F)] on W, 
so that trace 2(H) = 0. This is a contradiction which shows that g(—a) is one¬ 
dimensional. 

Definition 13.13 For each a e let 

H a f = 2HJot(H 9 ), 

where, as before, H a is the unique element of h such that Kiii(H a , H) = a(H) 
for all H e h . 
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Remark (2) Clearly [HJX] = 2X for any X e g{ a) and [H a 'Y]= -2Y 
for any Yeg( — a). Given any nonzero Xeg(a), there exists a unique 
Ysg(-oc) with [XY] = H a ' and then H a \ X, Y multiplies like the usual basis 
for sl(2, K). 

Definition 13,14 For any a, P e define <a, /?> = Kill(ff a , H fi ). Since 
spans h *, the dual space of h y this definition can be extended to a symmetric 
nondegenerate bilinear form <A, g ) on h *. 

Remark (3) Definition 13.14 can be thought of as a simplification 
of notation. Thus H a ' = 2H a /<a, a), and from Proposition 13.9 

<^> = z <*»*></*» «> 

a e # 

for any A, fx e &\. Now if the base field K is either the real numbers or the 
rational numbers, then for any p e 0t 

<P, P> = I <P, a> 2 > 0. 

ate & 

Thus <A, /*> is positive definite in this case. 

We make a few more observations. For A e A*, let A(if) = Kill(# x , //). 
Then if A = £ a, a f by Proposition 13.9, we have H k = £ for A(//) = 
X 0j = X a / Kill(// ai , H) — Kill(£ a f H ai , //) and the result follows. 
Next note for A, /i e A* that 

<A, /*> = KilK/f*, //„), 

for writing X = Y u a i <x i and ^ = £6^, we see <A, p> = Z fljfr/a,, a,> = 
Yj a ibj Kil!(//,., = Kilfi//*, H„) using the preceding paragraph. 

Proposition 13.15 If a, /? e iM and ff is not a multiple of a, then m = 
2<a, /!>/<a, a> is an integer, and /?,/?- a, /? — 2a,...,/?- wa are all in ^ if 
m>0 and /?, 0 + a, /? + 2a,are in ^ if m < 0. 

Proof Choose A'e^(a), Fe^(-a) so that H a ', A', 7 are a basis for a 
subalgebra of g isomorphic to sl( 2, X). Now ad ad X, and ad Y act on g, 
so these maps restricted to an irreducible submodule of g must have matrices 
like those of Theorem 13.11. Any root space of g must also be a weight space 
in an irreducible submodule of g; in particular, this is true for gi/i). For any 
Zeg(p) 

[H X ’Z] = P(2HJ(ol, a»Z = (2<a, £>/<«, a»Z = mZ. 

Thus m is a characteristic value of ad H tt '. Thus by comparing with p(H) in 
Theorem 13.11, m must be an integer. We are finished with the proof if m = 0. 
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If m > 0, then by again examining p(H) we find that m, m - 2, m — 4,..., 
—m are all characteristic values of characteristic spaces of &d(H a ') in the same 
irreducible subspace of g as g(P), in fact Z, (ad Y)Z, (ad Y) 2 Z ,.... (ad Y) m Z, 
form a basis for these spaces, and these elements lie in g(P), g(p — a), g(p - 2a), 

..., g(p - ma) as required. The same argument works for m < 0 by using 
ad X instead of ad Y. Also note exercise (4) of this section. 

Proposition 13.16 Let aj, a 2 .a„ e ^ be a basis of h* over K and let V 

denote the vector space over the rational numbers Q spanned by a„ a 2 ,..., 
a„. If a ,Pe&, then <a, /?> e Q and thus <A, /i> e Q for all A, geV. Also 
<A, /i> is positive definite on V. 

Proof By remark (3), < p , /?> = </*> «>* > 0 anc * therefore 

4 kp, py = 4 (p, pyKp, py 2 = £ 4 </?, *y 2 Kp, py 2 

aeM 

which is a sum of integers squared, using Proposition 13.15. Thus </?, /?> e Q 
I and since 2<a, /?>/</?, 0> is an integer, <oc, py e Q. Since X, fie V are rational 
linear combinations of elements of <A, p} e Q also. 

Now <A, is positive definite on V as follows. From remark (3) we see 
for A 6 V, with A(H) = Kill(// A , H ) for H eh, that 

<A, A> = Kill(/f A , Hj) 

using Proposition 13.9, and the dimension of g(a ) is 1. Thus <A, A) = 0 im¬ 
plies a (H x ) = 0 for all ad, which gives H x = 0 since & spans h *. Thus 
A = 0. 

Proposition 13.17 If a, p e and p , 4 are the largest nonnegative integers 
such that P + pae 0t and P — qxe then P — qx, P ~ {q — l)a,..., P + pci 
are all roots of g and 2 <a, /?>/<a, a> = q — p. 

Proof Continue the notation in Proposition 13.15 so that m = 
2<oc, /?>/<oe, a). For Z x eg(p + pa) and Z 2 eg(p - qx ), we have 

(ad JT.OZi = (m + 2 /^, (ad AQZ, = 0 , 

(ad H a ')Z 2 = (m - 2^)Z 2 , (ad Y)Z 2 = 0. 

Both Z t and Z 2 can be used to generate irreducible subspaces m l and m 2 
under the action of the algebra spanned by ad H a , ad X, and ad Y. Using 
irreducibility, these two subspaces must either coincide or have zero inter¬ 
sections. By checking the matrices of Theorem 13.11, we discover that the 
dimensions of m x and m 2 are m 4 - 2p + 1 and —(m — 2q) *f 1. The sum of 
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the two dimensions is 2 (p + q + 1 ), so the spaces coincide and are in fact 
equal to gift — qot) + gQi - (q - l)a) 4 - * * * + g(fi + pa). Thus all ft + kot are 
roots for -q<k <p (with k integral). Finally, since the dimension of m x is 
p + q + 1, we see p + q 4- 1 = m 4- 2p 4- 1 = 2q - m 4- 1 so that m = q — p 
as required. 

Exercises (2) Suppose r is the largest positive integer such that, for a 
given a g £%, we also have rot e 0t. Let 

k — g(~ct) 4- KH a ' 4- g( a) 4- g(2a) 4- • • • 4- g(rot). 

Show that k is a subalgebra of g which is invariant under ad H a \ ad X, and 
ad Y where X e g(ot ), Y e g{ - a). 

(3) By considering the trace of ad H a ' = [ad X , ad Y] restricted to k in 
exercise ( 2 ), show that 0 , oc, —a are the only integral multiples of a that are 
also roots of g. 

(4) Show that if a e r e K, and rot e then r = 0 or ± 1 . 

(5) Consider the Lie algebra g — &{<&) of example (2) of the previous | 
section. Letting ft = a 4 and a = oc 5 , verify directly the properties in the con¬ 
clusion of Proposition 13.17. 

( 6 ) Given g an arbitrary semisimple split Lie algebra as before, show 
that if a, ft, at 4- ft e St and 0 # X e g( a), 0 ^ Ye g{ft\ then [XY] ^ 0. 

Proposition 13.18 Consider a semisimple split Lie algebra g over a field 
K of characteristic 0 as before. If a lf a 2 ,..., a„ e ^ is a basis for h* over K 
and if V is the vector space over the rational numbers Q spanned by oc l9 ..., 
oc n , then & cz V. 

Proof Following the proof of Jacobson [1962], suppose ft = 
X”=i tiQtieffl. We must show that each t t eQ. Consider the system of 
equations 

2 <£, «,.>/<«,., 00 = £ i i 2 <a i , «;>/<«;, Xj} 

i= 1 

for j = 1, 2,..., n. This system of equations in t u t 2 .f„ has integral co¬ 

efficients and will have a rational solution if that solution is unique. However, 
the uniqueness follows from the linear independence of the a,’s over the 
field K. 

Remark (4) Given g as in Proposition 3.18 and a fixed a e 0t, define a 
nonsingular linear transformation S a : V —> V by 


S a (2) = X — 2«a, A>/<a, a»a 
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for all Ae V. Proposition 13.15 guarantees that S„(A)e V and at the same 
time shows that Stf) e * for all P* 9. Thus SJL*) = is, S. permute 

the finite number of elements of St. Notice also that SJx) a an 
(A e v\ S (A) = A> is a subspace of V of dimension one less that the dimension 
of V. These will be properties of 0t and V that will be very important in the 
next chapter. 

One can consider the group W of linear transformations on V generated 
by the S.’s as a varies over the elements of St. This group is finite because it 
can also be thought of as a set of permutations on the finite aet **• Moreover 
this group actually only depends on the Lie algebra g and is called the Wey 

group of g . 

Exercise (7) Find the Weyl groups of sl(2, K) and sl( 3, K). 





CHAPTER 14 

SIMPLE SPLIT LIE ALGEBRAS 


In this chapter we examine the important results on roots of a semisimple 
split Lie algebra given in Chapter 13. This leads to the study of abstract 
root systems by means of Dynkin diagrams, and we classify the irreducible 
root systems. Conversely, in the second section we construct some models of 
simple Lie algebras corresponding to the irreducible root systems. These 
algebras consist of four classical matrix types which we have previously 
considered in examples and five exceptional types. In the third section we 
discuss the inner automorphisms of these simple algebras in terms of sym¬ 
metries of the corresponding Dynkin diagram. 


1. Root Systems 


We now consider some of the previous results on roots and study abstract 
root systems. The Dynkin diagram of a root system is introduced, and by 
means of these diagrams we find the root systems which correspond to simple 
split Lie algebras. 

Definition 14.1 Let V be a finite-dimensional vector space over Q, the 
field of rational numbers, and let (xr, y) be a positive definite symmetric 
bilinear form on F 

(a) A finite subset 01 of nonzero vectors of F is called a root system in Fif: 

(i) 0t spans F, 

(ii) a e 01 and fa e 0t with t e Q, then t = ± 1, 
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(iii) a, /? 6 01 9 then 2(a, /?)/(a, a) is an integer; 

(iv) a, /? e ^2, then /? — 2[(a, /?)/(a, a)] aei 

The elements of ^ are called roots. 

(b) A root system is called irreducible if there is no proper subset Sf £ 0t 
such that (a, /?) = 0 for all a € 9* and e 0t but /? $ 

(c) For each ael define the a-symmetry of V, S a : V , by S a (x) = 

x - 2[(a, x)/(a, a)] a. 

The Weyl group W of the root system 01 is the group generated by the S a ’s 
as a varies over the roots of^. 


Example (1) Let g be a split semisimple Lie algebra over a field of 
characteristic 0,^2 is its nonzero roots, V as in Proposition 13.18, and (A, p) is 
the restriction of <A, p) to V. Various propositions of Chapter 13 show us 
that 01 is in fact a root system in V. 

Remarks (1) Many authors give a less restrictive definition of root 
systems and then prove properties (i)-(i v ) i° Definition 14.1(a). Our definition 
was chosen to correspond with properties already proved in Chapter 13. 
Also authors use different names for irreducible systems such as indecom¬ 
posable systems. 

(2) Certain properties proved for roots in Chapter 13 are obvious from 
Definition 14.1. Thus S a (^) =^for all ael; the elements of the Weyl group 
are completely determined by their action on $2; the Weyl group is finite; 
and if a e 0t 9 then —a = S a (a) e 


Proposition 14.2 If 0t c V is any root system, then there exist subspaces 

V t <= V with V= V t + V 2 H-+ as a vector space direct sum, and 

(x, y) = 0 if x e V;, y e V j9 i ± j. Furthermore & =® 1 u u32 m as a 

disjoint union where^ n V i is^n irreducible root system in Thus to 
determine all root systems, we need only find the irreducible ones. 

Proof If 01 <= Fis not irreducible by definition, M =0l x u ^ 2 , a disjoint 
and orthogonal union. Now set V { as the span of 01 i9 i = 1,2, so V x and F 2 
are orthogonal and V = V x + V 2 . Clearly 01 x is a root system in V t . Now repeat 
as often as necessary. 

Proposition 14.3 For any root system ^2 in V and any a e 0t 9 we have 
(S a (x),S a O0) = (x, y) for all x, y e V. 
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Proof From the definition we have 




, (*, y) - 4 y> + 4 (f ^)(«. y)(«..) _ (x y) 


(a, a) 


(a, a)(a, a) 


Proposition 14.4 For any root system 0 in V and a, p e 01, we have 
2(a, p)/(p, p) = 0, ±1, ±2, or +3. Define JV(ot, P) = 2(a, P)/(p, p). 


Proof By extending the base field of F to the real numbers R, we can 
consider 0t to be a subset of 2T, where m is the dimension of F and (x, y) 
on Fis just the restriction of the usual inner product on R m . If we let 6 denote 
the angle between at and p thought of as vectors in R m , we have 

cos 2 9 = (a, /?) 2 /(a, a)(p, P) 

so that N( a, P)N(P, a) = 4 cos 2 9 <4. Since N( a, /?) and AT(jg, a) are both 
integers, — 4 < iV(a, p) < 4. We must rule out the possibility that N(a, /8) = 
± 4. Assume N(oc, P) = 4. Then N(P, a.) =1 and cos 9 = 1. Thus (a, a) = 
2(a, P) = 4(P, P). However, then 0 = 0 and (a -2p,ot- ip) =0 so that a = 2P, 
a contradiction. Similarly if N(tx, P) = - 4, then 9 = n and a = - 2 p, again a 
contradiction. 


Examples (2) The following is a list of the possible root systems when 
F is two-dimensional (see Fig. 14.1). In each case we describe a V<= R 2 , 
draw a graph, and assign to the root system a certain “ type.” The notation 
for the types follow a scheme that will be explained later in this section; also 
see the work of Samelson, [1969, p.47]. 

(i) Type Ay x Ay, F= Q 2 , a = (1, 0), p = (0, 1), and 01 ={ + «, ±p}. 

(ii) Type A 2 , F= {(s, 3 1 / 2 r ): s, t e Q), a = (1, 0), p = (- 1/2, 3 1/2 /2), 
and^ = { + «, ±P,±(a + P)}. 

(iii) Type B 2 , F= Q 2 , a = (1,0), p = (-l, 1), and 0l = {± a, ±0, 
+ (« + P), ±(2a + ^)). 

(iv) Type <j 2 , F= {(^, 3 1/l2 ? le g}, a = (1,0), p = (—3/2, 3 1/2 /2), and 
^ = (±a, ±^, +(a + )?), ±(2a + 0), +(3a + )S), ±(3a + 2p)}. 

Notice that the root system of type Ay x A t is not irreducible but the other 
three root systems are. 

(3) Suppose we wish to compute the Weyl group of the above root 
system of type G 2 . The short (or long) roots are the vertices of a hexagon. 
It is clear that the Weyl group W must be a subgroup of the dihedral group 
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(iii) (iv) 

Fig. 14.1. Root systems, (i) Type A t x At . (ii) Type A z . (iii) Type B 2 . (iv) Type G 2 • 

of order 12 consisting of all rotations and reflections of this hexagon. We see 
S fi S a+fi {oc)^ S 0 ( 2oc + 0) = 2oc +/? and S^S a+p (2a -f £) = S/a) = a + so 
SpS a +p rotates the hexagon counterclockwise 60 degrees. Thus W is clearly 
the entire dihedral group of order 12. 

Exercises (1) Find the Weyl groups of the other three root systems in 
example^2) above. 

(2) Show that there are seven possibilities for the angle 0 between 
roots a and ^ ±a of a root system, namely 0 — 7r/6, tt/4, tt/3, tt/ 2, 27r/3, 
37i/4, 5 ji/ 6. Show that all of these angles occur in the graphs of the root 
systems of example (2). Show that if N(pc, fi) — ±2, then N(/?, a) = ±1,0 — 7i/4 
or 3 ti/ 4, and (a, a) = 2(p , /?). Also if N(a, fl) = ± 3, then a) = ± 1, 0 = tc/ 6 
or 5k/6, and (a, ae) — 3(/?, /?). Finally if JV(a, /?) = iV(j3, a) = ±1, then 0 = tt/ 3 
or 2 tt/ 3, and (a, a) = (/?, /?). 




288 


14. SIMPLE SPLIT LIE ALGEBRAS 


Proposition 14,5 If a and ft are two roots in a root system 01 with p ^ + a 
and (a, P) > 0, then a - P e 0t. 

Proof Since (a, f})> 0 implies JV(a, ft) > 0 and N(P, a) > 0, exercise (2) 
implies iV(a, p) = 1 orN(y?,a) = l.IfiV(a, P) = l,thenS/a) = a - A(a, p)p » 
a — P ei*.IfJV(j8, a) = l,then —S a (P) = — P + iV(/f, a)a = a — p recalling 
from remark (2) that ae^ implies -ael 

Proposition 14,6 If a, P e 01, p ^ ±a, and p and q are the largest integers 
such that P + pat 0t and p — qae 0, then 

(a) P -b fca e ^2 for all integers k with —q<,k<p\ 

(b) N(j8, a) = 2(a, £)/(«> a) = ? - p. 

Proof (a) The result is obvious unless p> 1 or #> 1. Assume that 
p > 1. Then — 3 ^ 1V(0, a) = 2(oc, j3)/(a, a) < 3 so that — §(ot, a) < (a, /?)< 
f (a, a), and for k > 2 we have (a, P 4- A;a) = (a, /?) + k(cc , a) > 0. By Proposition | 
14.5, (/? + k(x) - a = ^ + (4-l)ae^ if P + ka e 0t. Thus P , p + a, /? H- 
2tx,... ,p + pa e ^2. By considering —a instead of a, we find P~qa, p — 

(q - l)a,..., p e 01. 

(b) Since 

S a (p + koc) = p + *a - 2[(jS + fca, a)/(a, a)]a = /?-(/: + N(/?, a))oc, 

we must have S a (/? + pa) = P — qa and /? — (p 4- a))a = p — q&. Thus 
N(P,ol) — q — p as required. Notice that this is precisely the result in 
Proposition 13.17 proved for root systems of split semisimple Lie algebras. 

Definition 14.7 (a) A subset 01 <=^2 is called a root system basis for 
the root system^? in V if 0 is a vector space basis for V and, for any P e0t 
we have /? — £"= t m i a f where 01 = {a l9 a 2 ,...,a„}, and either all the m? s 
are nonnegative or they are all nonpositive. 

(b) A root system basis 0 <^0t is said to be irreducible if there is no 
nontrivial disjoint union 0 = ^uJ 2 with (a, j8) = 0 for all <xe0 t and 
P 6 ^ 2 * 

Proposition 14.8 Every root system possesses a root system basis. 

Proof Given the root system 01 in V, choose any vector space basis 
for V from 01, say {p u p 2i ..., /?„}. Any P e 01 can be written uniquely as 
P = B-i e 2- define a total ordering of by prescribing 

x > 0 for x e V if the first nonzero coefficient of x = Yj=i f tPi is positive, 
and for cl, p e 01 set a > p if a — p > 0. The usual properties for inequalities 
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now hold. Defined* = {a e 0t : a > 0} and^“ = { — a : e 01*} = {ae^: —a 
> 0}. Finally define 3 — {a e &* : for all ftye 0t* 9 a ^ p -F y}. The proper¬ 
ties of the next lemma show that ^ is a root system basis for^. 

Lemma 14.9 With the notation as above, we have the following. 

(a) spans V. 

(b) If a, p e @ and a ^ P, then (a, p) < 0. 

(c) 3 is a vector space basis of V. 

(d) If & s {a i9 ..., a*} and p e 01* 9 then either ft e 0$ or there is some 
0£j e 08 with p — oii e 01*. 

(e) If P e0$*, then there exist positive integers for i = 1,..., n with 
P = L?-i m i«i* 

Proof (a) It suffices to show that ^ spans . Suppose a e but 
is not in the span of 08, Then a = P + y where P, y e0t *. Now P > a, y > a, 
and either P or y is not in the span of 08. Thus by repeating this process we 
find there is no smallest element of 02* not in the span of a contradiction. 

(b) If oe, P e 08, cl ^ p, and (a, P) > 0, then by Proposition 14.5 either 
o£ — p s 02* or P — a e 0t*. However, then either a = (a — P) + P with P , 
a — P e 02* , or P = (/? — a) + a with a, p — cl e 02* , a contradiction. 

(c) We must show that the vectors of are linearly independent. 
Suppose they are not. Then we can write t i (x i = 0 where each a t e 08, 
and the a/s are ordered so that t t > 0 for each i < k, t x < 0 for i > k and some 
k'Zi 1 • Then 



= Z Z (—«j) ^ ° 

i=l j=k+ 1 

which is a contradiction. 

(d) Assume P is a root such that P g ^ + , P $ 0!, and p — a f for 
each a f e If for some a i we have P — a f e then a f = (a f — P) 4- P with 
(a* - /?), P e ^ + , a contradiction. Thus p — a, £ ^ for all a f e and by 
Proposition 14.5, (/?, a,) < 0. Using this fact and (b), we can follow the proof 
of part (c) to show that {/?, a l9 ..., a„} is linearly independent, a contradiction. 

(e) This follows easily by induction using (d). Thus if P $ 01, then there 
exists a f e 08 with P - 0 L t eM* . Since p - a* < P, we can assume by induction 
that P - otj = £ n k <x k with n k positive integers; this gives the results. 

Proposition 14.10 If 08 is a root system basis for a root system 0t in V 9 
then 01 is irreducible if and only if 0t is irreducible. 
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Proof Suppose 0 is not irreducible so ^ u with (a, P) = Ofor 
a g 0 X and P e 0 2 . Let 0 t be the roots in 0 spanned by 0 t , i — 1,2. Then we 
claim that = u 0 2 , for suppose ael but a $0 X and a £ ^ 2 . Then by 
Lemma 14.9(d) we can assume that aef with a = oc f + /?, a f e and p 
in the span of Now S a » = S at (a f + fl) = -a, + p e and this contra¬ 
dicts the definition of a root system basis. Conversely, suppose 0 =0 X u 
is not irreducible; then ^ = 0 X u where 0 { = n JL This shows that 
^ is not irreducible. 

Example (4) For each of the root systems in example (2), the notation 
has been chosen so that 0 = {a, /?} forms a root system basis. 

Definition 14.11 Given a basis 0 = {oc 1? ot 2 ,..., a n } for a root system^ 
in V, let N(cc, p) = 2(a, /?) for any a, p e 0 as in Proposition 14.4. 

(a) The matrix ( N(a t , a,)) is called the Cartan matrix of the root system^. 

(b) If oc = X"=i m ** a i e ^ + > then Xi-i m i is called the height of a 
(with respect to 0). 

(c) Two root systems 01 ,* in V { , / = 1, 2, are said to be isomorphic if and 
only if there exists a nonsingular linear transformation T from V x onto F 2 
such that T(0 X ) = 01 2 an< 3 (Tx, Ty > = c(x, j>)for all x, y e V x where 0 < c e Q 
and (x, y) is the bilinear form for V x and <x, y > is the form for K 2 . 

Example (5) The Cartan matrices of the root systems in example (2) are 


\2 0] r 2 -1] 

r 2 -li 


'2 —r 

Lo 2J’ [-1 2y 

1-2 2j’ 

and 

-3 2 


Also notice that the last three root systems have unique roots of maximal 
height. 

Remark (3) Notice that every Cartan matrix has 2’s down the diagonal 
and negative integers or 0’s elsewhere. Also notice that an isomorphism of 
root systems takes a basis of the first onto a basis of the second in such a way 
as to preserve the Cartan matrix. 

Proposition 14.12 (a) All the roots of a root system 0 can be determined 
from a basis 01 for0 and the Cartan matrix for^ with respect to 0. 

(b) Two root systems with bases such that their Cartan matrices are 
identical are isomorphic root systems. 

Proof (a) We will proceed by induction on the height of roots to 
find all roots of 0+. The roots of height 1 are just those in 0. Assume we 
know the roots of 0 + of height k and we wish to find the roots of height k + 1. 
By (d) of Lemma 14.9, every root of this height is of the form a + a f with 
of height k and a,, e 0. The Cartan matrix allows us to compute 
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N( a, a i). By Proposition 14.6(b), N( a, a i) = q — P where a + A:a,- e for 
-q<k<p.By the induction hypothesis q is known, so we are able to deter¬ 
mine p and decide whether a + a ( e or not. 

(b) This follows easily from (a). 

Remark (4) At this stage we have not shown a given root system has 
the same Cartan matrix up to a permutation of rows and columns no matter 
what basis is chosen. This is in fact true but will only become clear once we 
show that essentially different Cartan matrices correspond to root systems 
of nonisomorphic Lie algebras. 

Definition 14.13 The Dynkin diagram A of a root system ® in V with 
basis @ = {<*!,..., o„} consists of a graph in the real space R 2 with n vertices 
labeled with and JV(a,, o } )N(o s , «;) line segments joining the vertex 

of a f to the one a,. Finally if N(o, P)*0 and 0 3, P) > (a, a), draw an arrow 
on the line segments from the vertex of p to the vertex of a. 


Example (6) The accompanying graphs are Dynkin diagrams of the 
four root systems of example (2). 


o 

a 


o- 


a 


p * 




Type A t x A x Type A 2 Type B 2 


Type G 2 


Proposition 14.14 The Dynkin diagram A of a root system » in V with 
basis ^ ={a 1 ,...,a n } completely determines the corresponding Cartan 
matrix of M. 


Proof We must merely show that N(a f , a,-) can be determined for i ■£]. 
If vertices a f and a ; - are not joined, then N(a I , a i ) = 0. If they are joined 
by a single line, then 7V(a i? ccj) = N(cCj,ai) = -1 and (a f ,o0 - (ccj ,a 7 ). If 

N(ai> a iW a ^ a i) = then the r00ts cannot have the same length Wlth the 
longer being indicated by the arrow. Assume (ay,a,) > (a f ,a*). Then 

— N(a i , aj) = —2(0.1, Oj)/(«j, a j) < -2(a f , a j )l(u i , a,) = ~ a i)> so 
N(o i , a j) = -1 and N(Oj, a f ) — -2. Finally if N( a t , aj)N(oj, a,) — 3 and 
(aj , «j) > (otj, a ; ), then a similar argument yields N (a,-, a ; ) = -1 and N(ccj ,«,) 
= — 3. 


Remark (5) We have thus reduced the problem of finding all irreducible 

root systems to that of finding all Dynkin diagrams which (by Proposition 
14.10) must have each vertex joined to at least one other vertex; that is, the 
diagram is connected. 
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We shall show at the end of this section that there are only four infinite 
classes and five exceptional cases for Dynkin diagrams arising from irreducible 
root systems. These diagrams are listed in Proposition 14.15, and the notation 
used to label the diagram is historical with subscripts denoting the number 
of vertices and the letter the “ shape ” of the graph. 


Proposition 14.15 The list of Dynkin diagrams arising from irreducible 
root systems {a 1; a 2 ,..., aj are precisely the accompanying. 


Type A„, n> 1 
Type B„, n> 2 
Type C„, n> 3 



Type D n , n> 4 



°,«6 


Type E 6 


o-o-O' O' ' ■ «'. " 0 

oc x <x 2 cc 3 a 4 a 5 ’ 


oa 7 


Type Ej 


O O o -O-O-O 

“l «2 a 3 «4 «5 a 6 ’ 


°.«s 


Type E a 
Type F 4 


-° 4 = 




Type G 2 
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Example (7) Let oq = < -1, 0, 0, 0), oc 2 = (i, ±, i, £), a 3 = (0, -1,0,0), 
a 4 = (0, 0, — 1, 0), and let M be the system of roots 

±«i. /=1, 2, 3, 4; ±(a 2 + a*), /= 1,3,4; 

±( a i + a 2 + a 3 ), ± (oci + a 2 + a*), ±(a 2 + « 3 + a 4 ), 

±(«i + a 2 + « 3 + a 4 ), ±(a! + 2a 2 + a 3 + a 4 ). 

Then^ is an irreducible system of roots with @ = {a l5 a 2 , a 3 , a 4 } as a basis. 
The Cartan matrix and Dynkin diagram are 

' 2-1 0 0 " 

-1 2-1 -1 
0-120 
0-1 0 2 _ 

and 



The Dynkin diagram is of type Z> 4 . 

We now prove Proposition 14.15 in many steps. First we shall find the 
diagrams without the arrows and put them in later; we shall still call these 
diagrams without arrows “ Dynkin diagrams.” Thus if 98 — {<x u ..., a„} is a 
root system basis, we replace the a* by the unit vector X t = a f /||a f || where 
||a f || is the length of a f determined by the form (x, y) on F considered as in R n ; 
note Definition 14. L Consequently we study the set {X u ... ,X n } satisfying 

(X it X t ) = 1, (X i9 Xj) < 0, 4 (X i9 Xj) 2 = 0, 1, 2, 3 (*) 

for i # j and /, j = 1,..., n. These conditions come from Lemma 14.9 and 
Proposition 14.4 using = ||aJ|X| so that N(oc iy otj)N(cc j9 oc { ) — 4(X t ., Xj) 2 . 

The corresponding Dynkin diagram, still denoted by A, consists of the 
points X u ..., X„ in R n as vertices with the number of lines joining X t and Xj 
given by : X t and Xj are connected by 4(X f , Xj) 2 = 0, 1, 2, or 3 line seg¬ 
ments. Since we want to find the irreducible root systems, we see from 
remark (5) that the corresponding diagrams must be connected; that is, for 

points 17, V in A there is a sequence U l =U, U 2 > _, U k = V in A so that 

Ui and U i+l are connected in the diagram {note Proposition 14.6 and 
exercise (1), Section 14.2]. 

We now determine the connected diagrams using the following steps. 

(1) Let A be a Dynkin diagram corresponding to the vectors X i9 ..., X n . 
Let A' be the graph obtained by omitting a number of points and the lines 
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joining these points. Then A' is a Dynkin subdiagram corresponding to the 
remaining vectors X it ,X ik . 

This follows from the definition of a Dynkin diagram. For example, 

o-o-o-o-O-O SO 


becomes 


O-O O-0 555! Q 


Using conditions (*) we 

also have the following. 

(2) A correspondence between 

diagrams and values of inner products 

is given by 



o— 

-o 

Xj’ 

(X t , Xj) = -1/2, 

xT 

— O 

Xj’ 

(X t , Xj) = — 2 1/2 /2, 

o— 

X t 

- 0 

Xj' 

C X „ Xj) = — 3 1/2 /2. 


(3) There are no closed polygons. 


Proof Suppose X 1 ,...,X k are the vertices of a polygon where X i is 
connected to X i + 1 for 1 < i < k — 1 and X k is connected to X x . Then set 
X = £ Xj , and use (2) to compute 

(X, X)=Z(X i ,Xj)< 0. 

Thus (X, X) = 0 so that £ Xj = 0. This contradicts the linear independence 
of X U ...,X„ over Q. 

(4) There are at most three lines coming from a vertex. 

Proof Let X be a vertex with Y 1 ,...,Y k connected to X. Since there 
are no closed polygons, no two Y t are connected. Thus (Y t , Yj) = 0 for i # j. 
In the vector space spanned by X, Y,,..., Y k , we can choose a vector y 0 so 
that (Y 0 , 7 0 ) = 1 and (y 0 , Y f ) = 0 for / = 1, We also have (X, y 0 ) #0, 
otherwise X is dependent on Y 0 , Y 1 ,...,Y k which implies X is dependent on 
Y u ...,Y k , a contradiction to the choice of vertices in a diagram. Since 
X = X (X, Yj)Y t is an orthogonal representation of the unit vector X, we have 


1 = (X, X) = (X, y 0 ) 2 + (X, Y 1 ) 2 + --- + (. X, Y k ) 2 
>(X,Y 1 ) 2 + ---+(X,Y k ) 2 . 




1. ROOT SYSTEMS 


295 


However, 4(X, Y,) 2 is the number of lines joining X and Y t so that 4> 

i 4(X, Y f ) 2 which is the total number of lines coming from X. 

(5) The only connected Dynkin diagram in which three lines join two 
vertices is of type G 2 . 

Proof If this were not the case, there would be more than three lines 
coming from one vertex, contradicting (4). 

We now consider the remaining cases where one or two lines join two 
vertices. Using (4), we note that the accompanying graphs are not Dynkin 
diagrams: 



This can be be generalized as follows. 

(6) Let X 1 ,X 2 ,...,X f ,X i+ 1 ,...,X i+ *,...,X n be a set of vectors 
satisfying condition (*) and let A be the corresponding Dynkin diagram. 
Suppose the vertices XX i+l ,..., X i+k are such that X i+P is connected 
to X i+p+] by a single line for p = 0,..., k — 1 and let X = Y_q = o X i+(r Then 

the vectors X,. X,. u X, X i+k+1 ,..., X„ satisfy condition (*) and the 

corresponding Dynkin diagram A' is that of the original diagram except all 
the X.. X i+k have been replaced by the vector X. 

Proof Since X i+P is connected to X i+p+ i by a single line, we have 
2(X i+p , X i+J>+1 ) = — 1. Since there are no closed polygons, (X i+P , X i+ ,) = 0 
for p < q unless q=p + \. Thus we can compute 

(X,X)= t (X f+p ,x l+q ) 

p,q~ o 

= k + 2 X (X i+p ,X i+? ) 

p<q 

= * + (* — 1)=1 

to obtain X as a unit vector. Now let Y be a vector in A with Y ¥=X i+p . Then 
since there are no closed polygons, Y is connected to at most one X j+r for some 
r. Thus (Y,X i+p ) = 0 if p ± r which implies (X, Y) = £ (X i+P ,Y) = 
(X j+r , Y). Therefore 4(X, Y) 2 = 4(X i+r , Y) 2 = 0,1, 2, 3; that is, conditions 
(*) are satisfied for X„..., X t . u X, X i+k+l ,..., X„ and has Dynkin diagram 
as described. 
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Remark (6) In the following examples, the diagram A' is obtained from 
A by shrinking the vectors X iy ..., X i+k (together with the single line joining 
them) to a single point X, If the accompanying ^graphs were Dynkin diagrams 
A, then they can be shrunk to Dynkin diagrams A' of the form following (5). 



However, these small graphs are not Dynkin diagrams so the large graphs 
are not Dynkin diagrams. 

(7) The only connected Dynkin diagrams are the accompanying ones. 



Proof If a connected Dynkin diagram A has three lines joining two 
vertices, then A is of type G % . If A has two lines joining two vertices, then, 
from remark (6), A is of the second type above. Finally we have the case 
where any two given vertices have one line joining them. Thus if there are 
two endpoints, we obtain a diagram of type A n . If there are three endpoints, 
we obtain the third diagram. By remark (6) there cannot be four or more 
endpoints. 

(8) If a connected Dynkin diagram has three endpoints, then it is one 
of the accompanying. 
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Proof The third diagram in (7) is the only possibility, and one of the 
branches must contain only one vertex (besides the center vertex). If this is 
not the case, we use step (1) to obtain the accompanying subdiagram where 



Yi Y 2 Y 3 Y e Y 7 


{Y t ,... 9 Y 7 }c{X l9 ...,X n }.LetX= Y 1 + 2Y 2 + 3Y 3 + 2Y 4 + Y s + 2Y 6 + Y 79 
and using step (2) we compute ( X , X) < 0. Thus X — 0 which contradicts 
the linear independence of Y t ,... ,y 7 . Next a subdiagram of the form shown 
in the accompanying figure is impossible. Otherwise, let 


Y t Y 2 Y, Y 4 Y 6 Yj Y a 
x = y 1 + 2y 2 + 3 y 3 + 4 y 4 + 2Y S + 3 y 6 + 2Y 7 + y 8 
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and we obtain (X, X) < 0 which contradicts the independence of Y u .., i Y 8 . 
Thus only two vertices can occur at one end. 

Finally the form of the accompanying subdiagram is impossible. Other- 


y] Y 2 Y 3 Yt Y s Y 6 Y h Yg 

wise, X = Y x + 2T 2 + 3 F 3 + 47 4 + 5Y S + 6Y 6 + 3 Y 7 + 4Y 8 + 2Y 9 yields 
(X, X) < 0 and a contradiction. Thus the possibilities where one branch has 
one vertex (the endpoint) and the other branches have two or more vertices 
are of type E 6 , E 7 , E s . When two branches have only one vertex (which must 
be an endpoint) we obtain a diagram of type D n . 

(9) The Dynkin diagrams which contain two vertices joined by two 
lines must be one of the accompanying. 


B n > Cn 9 


o- o = === o-o 2*4 

Proof From step (7) we must have the accompanying diagram where 


Y x Y 2 Y° p - t Y p Z q Z^ t Z 2 Z x 

the Y's and Z’s are just a relabeling of the original X’s. Let Y — £f = i iY t 
and Z = Then using 2(Y t , Y i+1 )~ —1 = 2(Z fc ,Z jt + 1 ) and the 

fact other obvious vertices are not joined, we compute 

(Y, Y) - £ (iY„jYj) 

= t i 2 + 2 I ij(Y t , Yj) 

i — 1 i<j 

- fp-'lW+l) 

i= 1 i~ l 

= P 2 -Pip - D/2 = p(p + l)/2 

and 


(Z, Z) = q(q + l)/2. 
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Next using certain vertices are not connected and step (2), we compute 
(Y, Z)=Z ‘MY, Z k ) 

i,k 

= pq{Y p ,Z q ) = -pq2 l ' 2 l2. 

From Schwarz’s inequality (and the independence of Y and Z), 

pV/ 2 = (y,Z) 2 < \\Y\\ 2 \\Z || 2 = [p(p+ \)/2]f[q(q + 1)/2J, 

and since pq> 0 we obtain 2 pq < (p + 1 )(q 4- 1) ; that is, (p — l)(q - 1) < 2. 
Because p and q are positive integers, the only possibilities are p = 1 with q 
arbitrary, q — 1 with p arbitrary, and p = q = 2. This gives the desired 
diagrams. 

We now put the arrows on the diagrams replacing the unit vectors X t 
by the = Ua.-HX,- for / = 1, in the root system basis Since multiply¬ 
ing every a ( . by a nonzero real number does not change the situation, we assume 
one of the oc ( is a unit vector which we choose as an endpoint. Also note that a* 
and <Xj are connected by the same number of lines as X { and Xj. Thus if 
a, P e & are connected by one line 

N( a, a) = 1, 

and since N( a, /?) and N(ft, a) are negative integers, we have 

- 1 = N( a, P) = 2(a, MP, P) = N(p , a) - 2(ft a)/(a, a) 

which implies (a, a) = — 2(a, /?) — (j?, p). Thus there are no arrows on the 
Dynkin diagrams of type A n , , E 6 , E 7 , E 8 . 

If a, P e $ are connected by two lines, then N( a, P)N(P , a) = 2 which gives 
two possibilities : N(a,/?) = — 1, N(P,a) — —2 or N(oc,P) = ~2 y N(P,a) = —1 
so that (/?, jg) = 2(a, a) or (a, a) = 2(/?, P). Thus depending upon the choice 
of endpoint as a unit vector and using the results above for vertices joined 
by a single line, we obtain diagrams of type B a , C n , and F 4 (since F A is a 
symmetric graph, the choice of unit vector is immaterial). Similarly we obtain 
the diagram of type G 2 ; this completes the proof of Proposition 14.15. 

We have followed Jacobson [1962] and Samelson [1969] for the above 
proof and other variations are by Bourbaki [1968, Chap. 6] and Hausner and 
Schwartz [1968]. A method for choosing the vector X with (X,X) < 0 is 
given by Samelson [1969]. However with a little practice, it is easy to see how 
to make a guess for the desired coefficients; see exercise (6). 

Exercises (3) Find a root system in R 3 of type A 3 , that is, whose Dynkin 
diagram is of type A 3 . 
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(4) If09 is any root system and a e ^, set a' = 2a/(a, a). Show that S x . = S„ 
and that N( a', /}') = N(fl, a). Let 09* = {a' : a e 09} and show that 01* is a root 
system called the dual system of 01. If 0$ is a basis for 09, show that 09* = 
{a': a e 09} is a basis for ^2*. Finally show that if 0 is of type B„, then 09* is 
of type C„ and conversely. 

(5) Complete the determination of the arrows in the diagram of type G 2 ■ 

(6) (i) Show directly, without the above classification, that a diagram 
of type G 2 is not contained in any larger connected diagram. [Hint : Assume a 
diagram 

o .... ■ == o-o 

with three vertices X lt X 2 ,X 3 . Then consider X = rX t + sX 2 +tX 3 and 
use step (2) to find r, s, t so that (X, X) < 0.] 

(ii) As above show directly that the accompanying diagrams are impos¬ 
sible. 



2. Classification of Split Simple Lie Algebras 

In this section we will construct split simple Lie algebras corresponding 
to the Dynkin diagrams of type A n , B„, C„, D„, G 2 , and F 4 . Let g denote a 
split semisimple Lie algebra over a field K of characteristic 0, h a Cartan 
subalgebra, 09 the set of nonzero roots with A its Dynkin diagram, and we 
continue the previous notation. 

Proposition 14.16 The algebra g is simple if and only H09 is an irreducible 
root system. 

Proof Let a, p e 09. If (a, /?) = Kill(H„, H p ) = 0 then [H x Y] = p(H x ) Y = 
(a, P)Y= 0 for Ye g(P). Suppose 09 is not irreducible so 09 —09 x u 09 2 with 
(a, P) = 0 if a e 09 u p e 09 2 . Set h { as the span of all H x with nel, and g i = 

+ #i 9( a )’ * ~ 1. 2. From above it is clear that [/;, g 2 ] - 0 and \h 2 g { ] = 0. 
Now consider X e g( a), Y e g(P) with a e 09 1 and P e 09 2 . Since (a, a + P) = 
(a, a) * 0 and (P, <* + /?) = (p, P) # 0, we have a + P $ 09 so [XY] = 0 and 
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[9i 9 i\ — 0- Thus g — ^ © g 2 , a direct sum of ideals. The argument is revers¬ 
ible if g is not simple. 

Exercise (1) Show g is simple if and only if the Dynkin diagram A 
(corresponding to^ and is connected. 

Proposition 14.17 Let g be a split semisimple Lie algebra with nonzero 
root system and let & be a root system basis for 0L. For each tx t e set 

“ H’ ai — 2H a J(ct i9 a*). 

Choose any 0^ X t €#(«*) and then let Y { eg{ — a f ) be the unique element 
such that [X f Y,-] = Now let 

where n is the dimension of h (and which equals the number of roots in &). 
Then 

(a) s/ generates g ; 

(b) $4 is a part of a basis for g such that all the basis elements not in #0 
can be determined from A as well as all of the structure constants relative 
to this basis; 

(c) The structure constants of the basis above are all rational numbers 
and any such basis for g is called a Weyl basis. 

Proof We will show all of the properties of the proposition simultaneous¬ 
ly by showing by induction on the height of roots that we can step-by-step 
add basis vectors from higher root spaces which continue to satisfy the desired 
properties. 

First a basis for root spaces of height 1 is contained in stf . We can compute 
explicitly 

[HiHj] = 0 , 

[H i X j ] = N(<x j ,<x i )Xj, 

[Hi Yj] = —N(<Xj, cti)Yj, 

[X t Y { ] = 

[Xi Yj] = 0 if «*/ 

Also [XiXj] and [Y f Y y ] are in higher root spaces. Here the root space #(-a) 
will always be considered at the same time as g(oc). Notice that the nontrivial 
structure constants are entries of the Cartan matrix and cai; be determined 
from the Dynkin diagram A. 

Next assume for all a e with a of height k or less that X a e g(a) and 
Y a eg{ — a) have been defined so that for any product of these elements 
which also lies in such a root space, all of the properties of the proposition 
hold. For any /? e of height k 4- 1, we can write /? = a f + a with a f e 
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and a of height k using Lemma 14.9. Define = [X^X^] and Y fi = 3^], 

Suppose in a similar manner y ~ a ; 4-<5. We can now list some products which 
shows that the products at this height also satisfy the required properties. 
Thus using the Jacobi identity, we obtain 

[X, Y f ] = [XAXM YJ]] + tix.tr, r.]]xj 

= [x d [[x a Yjrj] + [x.rax, yjfl + n(«, a^x. yj 
+ [[i , i tx i rj]xj. 

There is a similar formula for 

tx y ^] = [tx > xj[r,rj] ) 

[XitXjXJ] = [X y [XjX,]] + [[XjX y ]X a ], 
tH i X y ] = N(y,a i )X y . 

This completes the proof. 

The following is an easy consequence of the preceding result: 

Corollary 14.18 For any split simple Lie algebra g over a field K of 
characteristic 0, there exists a subset g 0 c: g such that g 0 is a split simple Lie 
algebra over Q. The root system of g 0 is the same as that of g , and g £ 
“ 9o ®qK' 

Theorem 14.19 Given a field K of characteristic 0, then: 

(a) there exist split simple Lie algebras which have root systems deter¬ 
mined by each of the Dynkin diagrams listed in Proposition 14.15; 

(b) two such algebras are isomorphic if and only if they produce the 
same Dynkin diagram. 

(c) the algebras are described in Table 14.1 where we use the notation 
s/(n, K) = si(n 4- 1, K), @(n, K ), #(«, K ), @(n, K ) for the algebras defined in 
Chapter 13. 

We separate the discussion of Theorem 14.19 into several parts and give 

TABLE 14.1 


Root system 

Model 

Rank 

Dimension 

A a 

s/(n, K) 

n > 1 

(h -f 2)n 

Bn 

K ) 

n > 2 

(2 n 4* l)n 

c n 

r <S{n, K) 

n> 3 

{In 4* 1)« 

D n 

2>(n, K) 

n > 4 

(2 n — 1 )n 

E 6 


6 

78 

E n 


7 

133 

E s 


8 

248 

F * 

®(A/ 3 8 ) 

4 

52 

<*2 

&(<&) 

2 

14 
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some examples. First we assume part (a) and now consider (b). Let g x and g 2 
be two split semisimple Lie algebras and suppose : g x -► g 2 is an isomor¬ 
phism. Then Cartan subalgebras, weight spaces, etc., of g x map onto a 
corresponding item in g 2 . Also if Kill^ , ) denotes the Killing form in g i9 
then Ki\\ 2 (4>X, <j>Y) = Kill^X, Y ). To see this we just note that from 4>[XY] — 
[<j)X(l)Y] that ad cj)X = <j) ad Now take the trace according to the 

definition of the Killing form. The Dynkin diagrams are determined by the 
number of roots (which is the same for g x and g 2 ) and the numbers N^oc, /?) 
for i = 1, 2 using Proposition 14.14. These numbers are in turn determined 
by the Killing forms which are equal as above. Thus the Dynkin diagrams 
are the same up to possible relabeling. 

Conversely, suppose g x and g 2 have the same Dynkin diagram. Then 
Proposition 14.17 applied to the corresponding sets of generators s4 x and s4 2 
show that we obtain the same structure constants for a basis determined by 
s/ x and $4 2 . Choosing such a basis, we see that the linear map which assigns 
the corresponding basis elements is an isomorphism; also note the proof of 
Samelson [1969, p. 52] for a similar argument. 

Example (1) When part (a) is proved, we shall see that g — sl( 3, K ) 
has a root system of type A 2 . To illustrate the details of constructing g 
solely from the Dynkin diagram, we apply Proposition 14.17. First we have 
the accompanying Dynkin diagram and the Cartan matrix 


o o 

<*2 



where N(a u a 2 ) = N(a 2 , a,) = — 1. Also N(ix 1 , a 2 ) = q —p where a, + kcc 2 e 01 
for —q<k<p. However q — 0 so p = 1 and a 1 + a 2 e We can assume 
(oti>«i) = (a 2 ,a 2 ) = 1 so (a,,a 2 ) = — i, and we can compute (a, + a 2 , + a 2 ) 

= 1, (atj + a 2 ,a,) = (a, + a 2 ,a 2 ) = This gives 

JV(ai + a 2 ,a,) = lV(a, + a 2 , a 2 ) =1. 

Thus q = 1 and p = 0 in these cases, so 2a, + a 2 4 a, + 2a 2 4 0t. Let 
{X u H u Y u X 2 , H 2 , Y 2 } be as in the proof of Proposition 14.17. For 
ai + a 2 , the root of height 2, set X 3 = [X t X 2 ], Y 3 = [7, y 2 ], We can 
compute 

[ x 3 y 3 ] = [xjpr.yjyj] + [xjyj^y,]]] + N(a 2>ai )[x 2 y 2 ] 

+ [[n[^i^]]^ 2 ] 

= [X^HJ] - H 2 
= -H 1 -H 2 , 

[y 2 x 3 ] = [[y^urj + [A-jfy.x,]] = x 2 , 

[x,y 3 ] = y 2 . 
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Now let g be the Lie algebra spanned by X l9 X 2 , X 3 , H u H 2 , Y u Y 2 , Y 3 
with multiplication determined as in Proposition 14.17 and listed in the 



X x 

X 2 

x 3 

Hx 

H 2 

Yx 

y 2 

y 3 

X x 

0 

X 3 

0 

- 2 Xx 

Xx 

Hx 

0 

y 2 

x 2 


0 

0 

x 2 

—1X 2 

0 

H 2 

-Yx 

x 3 



0 

x$ 

X 3 

-x 2 

Xx 

£ 

1 

£ 

I 

H x 




0 

0 

-2 Yx 

Y 2 

-y 3 

h 2 





0 

Yx 

-2 Y 2 

-y 3 

Y t 



(*) 



0 

y 3 

0 

y 2 







0 

0 

Y 3 








0 


accompanying tabulation, where (*) is determined by anticommutivity. Now 
g is a split Lie algebra of type A 2 . 

Remark (1) Once we know that the Lie algebras of Theorem 14.19 
exist, Proposition 14.17 could be used to construct an algebra of the proper 
type from the Dynkin diagram. The above example shows that if the Dynkin 
diagram is very large, there will be many computations involved. 

We now consider parts (a) and (c) by giving models of algebras corres¬ 
ponding to a given Dynkin diagram. We do this for types A y B, C, D , G 2 , and 
F 4 . Most of the computations have already been done in the examples in 
Section 13.2. Since the Dynkin diagrams will be connected, these Lie algebra 
models are actually simple algebras. 

Type A n Let# = sl(n + 1, K) with n > 1 as in example (1), Section 13.2. 
Recall that it was determined that the roots of g were the set of all ot jk : /z -► K 
such that ctjkCZHi a n Eu) = a n - a kk where a n = 0. More precisely 

& = {<*jk -j ^ 1 </> k < n + 1} 

with a tJ + a jk = <x ik and a Jk = -a kJ . Set a, = a,*, a 2 = a 23 .a„ = a„„ +1 , 

and let $ = {oc 1? a 2 , ..., aj. We claim that & determines a Dynkin diagram 
of type A n . Thus if i <j, then <x jk = a ;y+1 + <x J+lj+2 + ■■■ +« k - lk , and 
since oc Jk = — cc kJ , it is clear that & is a root system basis for 

It was also shown that 

H ajk = (Ejj - E kk )/2(n + 1) 
so 

<®y > «*m> = Kill((£ if - Ejj)l2(n + 1), (E kk - E mm )/2(n + 1)) 

= [l/2(n + 1)] trace [{E n - E n )(E kk - E mm )]. 
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Thus by computing this trace, we see (u Jk , <x Jk } = l/(n + 1) and 
a/+K+ 2 > = -1/2(n + 1). Finally <a„ +1 , % +1 > = 0 fory > / + 1. 

To construct the Dynkin diagram, we join the n vertices a t ,... , a„ by 
various lines: <x t and ctj are joined by N(a ( , oij)N(iXj, a,) line segments where 
N(oc, p) = 2<a, /?>/</?, f}}. From the above computations, we see the adjacent 
roots ctj and a i+1 are joined by N(a t , a 1+1 )N(a, +lJ a f ) = 1 line segments, 
while nonadjacent roots a, and aj are joined by N( a,, <Xj)N(tXj, a,) = 0 line seg¬ 
ments. If a f and a } are joined by a single line, then N(a t , ctj) - N(ctj , a,) = -1 
and <a f , a ( > = <a y , a,>. Thus there are no arrows. Consequently the Dynkin 
diagram for sl(n + 1, K) is 

O-O ■ — ■ - o * * * o 

a l <*2 <*3 a « 

and so sl(n + 1, K) is a split simple Lie algebra of type A „. A notation which 
would agree with the other notation in Chapter 13 is to define K ) = 

s!(n + 1, K\ 

To obtain the dimension given in Theorem 14.19, just use the dimension 
of sl{n + 1, K) as (n + l) 2 — 1. Also note Section 2.3 for arguments concerning 
the other dimensions. 

Type B n Let K ) with n > 2 be the Lie algebra given in exercise (1), 

Section 13.2. Thus K ) consists of all (2 n + 1) x {In -f 1) matrices over 

K such that J n A t J n — —A where 

"10 • • • 0 " 

J _ 0 O n I n . 

.6 /„ 

that is, ${n, K) consists of all skew-symmetric matrices relative to the non¬ 
degenerate symmetric bilinear form determined by J n on a 2n + 1-dimensional 
vector space over K. 

From the table of roots for @{n, K), let oc x — cc lt 2 , a 2 = a 2 , 3 ,..., 
a n _ x = a n _ lt „, and a n = and let ^ = {a l5 ...,a n }. Then a straightforward 
computation shows J is a root system basis. For example, let H = 
E<'iti,iti(4!,itr^t,ti,i+.+i) be in the Cartan subalgebra A. Then 
for p < q 

^p, <|C^0 ^p +1» p + 1 +1, +1 

= ^p+l.p + l — ^p + 2,p+2 + tfp+2,p+2~ #p+3,p + 3 + ' * * 

4“ ^q,q a q + t, q+1 
— ( a p + ' * * + a 4-l)(£0- 

Similarly P Pin = a p 4- * * * + + 2a„, etc. 
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Instead of using direct computations as in the /I,,-type, we shall use 
Proposition 14.6 to determine the 5,-type Dynkin diagram. First let H eh 
and « lt ..., a„he as in the preceding paragraph. Then we compute the numbers 

N(ct if ccj) 

by computing the corresponding numbers p and q according to Proposition 
14.6. Thus consider a f and a i+1 for i = 1— 2. We shall show 

«i +1 - a| and a i+1 + 2a ( * 

are not roots while 

a f and a i+1 +a £ 

are roots. Consequently ? = 0, p = 1, and iV(a f , ct i+l ) = — 1. Thus for H eh 
as above we see 

0*i+i a /)C^0 = a i+i, “ a i, i+i(H) 

— 2<2j + 2.i + 2 — ^i + 3,i+3 + 

Comparing this value with the possible values of roots given in exercise (1), 
Section 13.2, we see a i+1 — a* is not a root. Similarly 

( a i+i + 2a i)(H) = 0 *j+i,i +2 + i+i)(^0 

— 2di + I f i + 1 — +2,1+2 + 3, i + 3 

which also shows a i+1 -f 2a f is not a root. Next cc i+l 4- a £ = a t% i+2 is a root 
because 

(«/+1 4- ai)(H) = a £+1> f +2 (H) 4- a f> i+1 (H) 

— a i + 1, i + 1 ~ a i + 3, i + 3 ~ a i, i + 

Thus for the root a f and a l+1) we see N(a f , a i+1 ) = — 1 as desired. Similarly 
we can show N(a I + lJ af) = — 1 so that in the Dynkin diagram the vertices 
a f and <x i+i are joined by N(a f , a i+1 )AT(a t+1 , a*) = 1 line segment. 

Next for a f and olj with j ^ i + 1 ^ n and j ^ rc, we see 

(a t - — = 0/ + 1, j + i — #i + 2,i + 2 — a j + l,j + l + a j+2, j+2 

which shows a t . - a y is not a root. Similarly a, 4- a y is not a root so that p = 
? = 0 and A r (a i ,a J ) = 0. Thus in this case the a f and a,- are joined by 
N{a t , <Xj)N((Xj , a f ) = 0 line segments. 

Consider a„_ x and a„. A computation shows <x n ^ l — a„ is not a root, but 

OX n~X 2a„)(i/) = ®n, n ^n + l,n + l 

so that a n+1 4 2a„ = /? n + lj „ is a root. Thus? = 0 ,/> = 2, andA^.^, a n ) = -2. 
A similar computation shows N(oc n ,oc n _ l ) ~ — 1 so that oc n _ 1 and a„ are joined 
by N(oi n „ u a„)iV(a„, = 2 line segments. 
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The roots a l9 ..., cc n ^ 1 are all the same length, since 
N(a i9 cc i+1 ) = N(oc i+1 , a £ ) = -1 
implies [using Proposition 14.6(b)] 

(a i+1 , a i+1 ) = -2(a f , a i+1 ) = (a„ a f ). 

Next 

-1 = N(a„, a n _i) = 2(a„_ 1 , aJ/Ca^j, a*^) 

and 

-2 = N(a fl _!, a„) = 2(a„, cc n _ x )/(<x n > a„) 

implies (a n _ l5 a„-i) = 2(a„, a„) so that a„_t is longer than a„. This gives an 
arrow from a n „ x to a„ and the accompanying Dynkin diagram. 

^2 a 3 a n-2 a /i-l a n 

The same process can be used for the C n -type and Z)„-type which we now 
sketch and leave the computations to the reader. 

Type C n Let #(n, K) with n > 3 be the Lie algebra of exercise (3), 
Section 13.2. From the table of roots, we set 

a l = a l, 2 > a 2 — a 2, 3 j • • • 5 a n -1 ~ a n- 1, n > a « = . 

Then ^ = {a 1 ,..., aj is a root system basis. The roots <x u ..., a n _ t are the 
same length, but (a n ,a„) > a,,^). Thus we obtain the accompanying 

Dynkin diagram. 



Type D„ Let K ) with «>4 be the Lie algebra of exercise (2), 

Section 13.2. Then the root system of Q){n, K ) can be considered as consisting 
of precisely the roots j , j , and y ir j listed as roots of 8&(n, K). Thus if we 
let 

a l = a i, 2 5 * • • s a n-l = a n-l, »> a « = Pn-1, n > 

then ^ = {a l5 ..., a„} is a root system basis. The roots a t ,.. ., a n _ 2 are con¬ 
nected by a single line segment. Also the roots oc„_ 2 and a,,^ are connected 
by a single line segment and so are the roots a„_ 2 and a„. Thus we obtain 
the accompanying Dynkin diagram. 


<*«-1 
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Exercises (2) Complete the computations to determine the Dynkin 
diagram for the C„- and Z>„-types. 

(3) In the statement of Theorem 14.19, we make restrictions on the rank 
to obtain nonisomorphic algebras. When these restrictions are removed, 
show the following isomorphism between types; we denote these isomor¬ 
phisms symbolically by 

(i) C u 

(ii) B 2 £ C 2 , 

(iii) A 3 £ Z) 3 , 

(iv) D 2 — A y -h A t ; that is, a member of type D 2 is a direct sum of 
two of A t (see the book by Freudenthal and De Vries [1969] for details). 

Next we consider some exceptional simple Lie algebras. 

Type G 2 Consider the derivation algebra of a split Cayley algebra 
as described in example (2), Section 13.2. Thus let g = = { D(A , x,y ): 

A e si (3, X) and x, ye X 3 }, and let h = {D(H, 0, 0) : H e sl( 3, X), H diagonal}. 
It was shown that^£ = {± a f : / = 1, 2,..., 6}, where the a/s can be described 
by setting H = £ f =1 a n E u esl( 2, X) so a n 4- a 22 + a 33 = 0 and listing 
the a f (Z)(if, 0, 0)) in the accompanying tabulation. Set a = a 5 and fi = a t . 
Then = {a, /?} is a root system basis for since a + P = oc 4 , 2a + p = — a 6 , 
3a + /? = a 3 , 3a + 2/? = a 2 , and ^ = {±a, +/?, ±(a +/?), ±(2a + /?), 
±(3a + j8), ±(3a + 2/?)}. 


i 

0 , 0 )) 

1 

#11 — #22 

2 

# 11 ” #33 

3 

#22 — #33 

4 

#11 

5 

#22 

6 

#33 


Now from this listing of roots, we see /?, P + a, P + 2a, P + 3a are roots, 
but P — oc is not a root. Therefore, from Proposition 14.6, weseeN(/?, a) = —3. 
Similarly, since a and a + /? are roots while a — ft is not a root, we see 
JV(oe, P) = -1. Thus a and P are joined by N(a, P)N(P, a) = 3 line segments. 
From 


~3=N(p, a) = 2(a,/0/(a, a) 

we see 3(a, a) = -2(a, /?). Similarly -1 = W(a, £) implies (/?, P) = -2(a, /?). 
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Thus (/?, /?) = 3(oc, a) so that ft is longer than a. This gives the accompanying 
Dynkin diagram so that &(<£) is a split simple Lie algebra of type G 2 ■ 



From Section 9.6 we note that ^ is a simple nonassociative algebra with 
an identity element. Thus from exercise (3),, Section 12.5, every derivation 
of ^ is inner. More explicitly, if P is the subspace of End(^) spanned by all 
L(X ) and R(Y) for X , Ye then the Lie transformation algebra = 

P + [P , P] ; see Section 7.2. Now the identities for # yield for any X, Y e ^that 

D(X, Y) = [L(X\ L(Y) ] + [L(X\ R(Y)] + [R(X) 9 R(Y)) 

is an inner derivation of and consequently any derivation D of ^ equals 
Z>(^T;, Yi) for suitable 

Type We again use derivations of a suitable nonassociative algebra 
to construct a Lie algebra of type F A . First recall from example (3), Section 
9.6 that the split Cayley algebra # given by 2 x 2 matrices has an involution 
X -► X given by 

f“ ;l-f * -i 

v b\ [ — v a] 

Now let M 3 8 denote the 3x3 Hermitian matrices with entries from # and 
with this involution. Thus M 3 8 is the set of all matrices 

X T 
X a 2 Z 
Y Z a 3 

where a, e K and X, Y, Z e If S, Te M 3 S , then the product 
S* T=i(ST+ TS) eM 3 8 

where ST and TS is the usual matrix product. Thus M 3 8 is a Jordan algebra; 
that is, a nonassociative algebra which satisfies the identities 

S T= T• S and (S 2 • T) • S = S 2 * (T- S). 

Let J — M 3 8 and let g — @(J). Then from the identities we see the Lie 
transformation algebra j£f(7) = P + [P, P] where P is the subspace spanned 
by all L{X) for X e J. Since J is a simple nonassociative algebra with an 
identity element 1, every derivation D of J is inner. Thus if D = 
L(Z) + X [UXi\ L(y f )], then 0 = Z>(1) = Z so that Z> = £ [L(2Q, L(Yd] for 
suitable X i ,Y i e J, We now sketch the proof that g is of type F 4 [Jacobson, 
1971a, b, p. 407; Schafer, 1966]. 
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Let E tJ denote the usual matrix basis and E t = E u for i, j = 1, 2, 3. Then 
from the above matrix description of J, we set 

Aq — AEjj + AEji 

for A e Thus A i} £ J and 

J iJ = {A iJ :Ae^} 

is a subspace such that J u = Jji , and we have the direct sum 

J = KE X 4- KE 2 4- KE 3 + /12 4“ J 13 + 1 / 23 - 

Let D 0 ~ {D eg: DE t = 0 for i = 1 , 2, 3}. Then D e D 0 implies £>J 0 <= J u 
using a few matrix computations involving E u * S. Thus D induces an endo¬ 
morphism of Ju . This, in turn, induces an endomorphism D tj of ^ by the 
formula DA u = (Dy A) u for D e A e and A i} e J u as above. From this 
we obtain the following result [Jacobson, 1971a; Schafer, 1966}. The Lie 
algebra L of all endomorphisms of the Cayley algebra # which are skew- 
symmetric relative to the bilinear form induced by the norm in ^ is of type D A . 
Furthermore the map D 0 ^>L: D-+ D 4J is an isomorphism of D Q onto L. 
Using these results, we can decompose 

g = D 0 4" 0i2 4- #13 4- #23 (*) 

where ^ 12 = {[UE X \ UA l2 )]: A e <€\ = and 

g 23 = {[L(E 2 ), L{A 23 )\z A e&}. Now choose a four-dimensional Cartan 
subalgebra h for D 0 as discussed in type D n and use e k to express the roots 
to obtain for i ^ j 24 roots <x if j — e f — Sj , P it j = e f 4* tj , j = — £ f - £ ; for 
1 ^ Uj < 4. Thus since the dimension of the g i3 is 8 , we obtain the dimension 
of g is 52. 

In the above decomposition (*) for g , ad^ h acts diagonally on g xl9 g X3 , 
and g 23 and for i = 1, 2, 3, 4 has roots ±£ f ; ±A f where A i = i( e i 4- £ 2 4- £ 3 
4- £4) - £i ; and ±M 4 where M x = i(s x 4- £ 2 4- £ 3 4- £ 4 ), M 2 = K fi -i + e 2 “ e 3 
- 64), M 3 = K £ i - £2 + £3 - £ 4 )> 4/4 = - £ 2 - £ 3 4- £4). From this h is 

a Cartan subalgebra for g . 

Next let <x x = i( e i — £2 “ s 3 ” £ 4 ^ a 2 — e 4 , cc 3 = £ 3 - £ 4 , a 4 - £ 2 - £ 3 . 
Then ^ = {ai, a 2 , a 3 , a 4 } is a root system basis. To obtain the Dynkin 
diagram, we use the usual process to compute the N(a f , a J) noting we can add 
a 2 twice to <x 3 and still have a root; this gives the accompanying diagram. 



Exercise (4) Let A denote either of the algebras or M 3 S given by 
the above matrices, and let A 0 denote the subspace of matrices of trace 0. 
Show that g = @(A) acts irreducibly on A 0 . 
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For an abstract detailed construction of algebras of type E, we refer to 
Hausner and Schwartz [1968] and for construction relating derivations and 
nonassociative algebras, details can be found in Jacobson [1971a, b] and 
Schafer [1966], 


3. On Automorphisms of Simple Complex Lie Algebras 


In this section we restrict ourselves to simple Lie algebras over the com¬ 
plex numbers and sketch results on automorphisms of their algebras. The 
algebras must automatically be split, so each must be of one of the types 

»•. *, G 2 , etc. 

If a closer study of the relationship of Cartan subalgebras and the corre¬ 
sponding root systems to automorphisms of split simple Lie algebras is made, 
then we find that the Dynkin diagram is also related to automorphisms of 
the algebra. We state without proof an important theorem about this rela¬ 
tionship; this theorem is part of Theorem 33.9 of Freudenthal and de Vries 
[1969] and another variation is given by Jacobson [1962, Chap. 9 ], 

Theorem 14.20 Let g be a simple complex Lie algebra, let Aut(^) and 
Int( 0 ) be the complex Lie groups of all automorphisms of g and the subgroup 
of inner automorphisms, let 39 be a root system basis for a system of roots 0t 
of g, and let A be the corresponding Dynkin diagram. Let Aut(A) denote 
the group of automorphisms of A. More precisely Aut(A) is the group of the 
permutations of & = {a t ,..., a„} which preserve lengths of the #,’s and angles 
between pairs of them (this information is easily read from A). Then the group 
Aut(#)/Int( 6 r) is isomorphic to Aut(A). 

Proposition 14.21 Table 14.2 lists Aut(#)/Int(<?) for all simple Lie algebras 
g over the complex numbers. 


TABLE 14.2 


Type of g 

Aut(g)/lnt(g) 

A n n> 2 

2i 

B„ n> 2 

{ 1 } 

C n n> 3 

{ 1 } 

D„ n >5 

Zi 

Ai, G 2 , F*, £ 7 , E 8 

{ 1 } 

d 4 

S 3 

E 6 

Zi 
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Proof The only Dynkin diagrams having one symmetry are those of 
types A„, D„, E 6 . The nonidentity automorphism of A in these cases are given 
by arrows shown in the accompanying diagrams. The only Dynkin diagram 




with more than one symmetry is D 4 , and in this case the outside roots 
can be permuted in any manner. Thus two possibilities are illustrated in the 
accompanying diagrams. 



Proposition 14.22 Let g = sl(n,C) with n> 2 and <P e Aut(gf). Then 
there is some U e SL(n, C ) such that 

<i o{X ) = UXU' 1 for all Xeg, 

or 

<p(X) = - UX’U' 1 for all Xeg. 

Proof It is easy to check that both definitions give automorphisms of g. 
If Te g, then (exp ad Y)(X) = (exp Y)X(exp Y ) _1 where exp Ye SL(n, C). 
Thus if <pe lnt(g), then <p can be written as an automorphism of the first 
type above. Since, by Proposition 14.21, Aut( 0 )/Int( 0 ) has order 1 if n = 2 
and has order 2 otherwise, we need only show that X -*■ — X‘ is not an 
inner automorphism if n > 3. Equivalently we need the fact that there is no 
U eSL (n, C) such that UX = -X'U for all Xesl(n, C). This is easy to 
show for n ^ 3 by choosing in succession the usual basis for g to substitute in 
for X and show that certain entries of U are 0 until one obtains a contra¬ 
diction. For n- 2 we have the following result. 

Exercise (1) Find a matrix UeSL(2.,C ) such that UXU = X 
for all Xesl(2,C). 
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Proposition 14.23 Let ^ be a Cayley algebra over the complex numbers, 
let g = be its derivation algebra, and let Aut (tf) be the automorphism 
group of %>. If <p e Aut (g), then there exists some Te Autftf) with (o(D) = 
TDT~ l for all Deg. 

Proof Since g is of type G 2 , we have Aut(^) = Int(^). If E e 
the exp E e Aut('tf) and exp(ad E) e lat(g). However, (exp(ad E)){D) — 
(exp E)D(ex p £) _1 , and since the maps exp(ad E) generate \ni{g), we obtain 
the results. 

Exercises (2) F,ovc a result analogous to Proposition 14.23 for the 
algebra M 3 a . 

(3) Each of the simple Lie algebras ®(n, C), #(n, C), ®(n, C) can be 
described as a set of matrices AT satisfying - -X for some fixed J. 

Consider the corresponding groups B(n, C), C(n, C), D(n , C) as the set of 
nonsingular matrices U such that JU’J -1 = U~ 1 . 

(i) Show that any inner automorphism of the three Lie algebras above 
can be written as X —* UXU 1 with U in the corresponding group. 

(ii) Show that for g of type B„ or C„ that Aut(tf) = \nl(g ); note Propo¬ 
sition 14.21. 

(iii) For 9 of type D n with n > 4, show Aut(^) has an automorphism 
which is not inner [Consider the map X ->• PXP' 1 where P is given by the 
interchange map 

C^i,..., X 2n ) —» (X lt ..., X„_j, X n + l , X „, X n+2 ,..., X 2n ).] 

What can be said about X -» — X* ? 

(iv) For n > 4 and g of type D n , show Aut(#)/Int(#) is of order 2. 
What is the general form of an element in Aut(^) ? Type D A is more complicat¬ 
ed, and we refer to the work of Hausner and Schwartz [1968] and Jacobson 
[1971a]. 



CHAPTER 15 


SIMPLE REAL LIE ALGEBRAS 
AND GROUPS 


In Section 12.6 we showed that a finite-dimensional simple Lie algebra over 
the real numbers is either isomorphic to the realification of a simple complex 
Lie algebra or is isomorphic to a real form of a simple complex Lie algebra. 
We have classified the simple complex Lie algebra and have shown that a 
realification of a simple complex Lie algebra is a simple real Lie algebra. 
Thus it suffices to describe the real forms to obtain a classification of the 
simple Lie algebras over the real numbers, and we do this in Section 15.1. 
Next we consider the irreducible representations of simple real and complex 
Lie algebras discussing the maximal weight and how they are determined, 
the basic representations, and how an irreducible representation is obtained 
from these. We also discuss Weyl’s formula for the dimension of an irre¬ 
ducible representation and give examples showing how the Dynkin diagram 
determines an irreducible representation. Finally in Section 15.3 we discuss 
how the previous results apply to Lie groups. 


1. Real Forms of Simple Complex Lie Algebra 


In this section we first discuss generalities about real forms of a simple 
complex Lie algebra g by noting they are given in terms of a conjugation 
operator C :g -+g. These conjugations can be written in the form C = <£C_ 
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where C_ is a conjugation of a compact real form and <f> is a suitable 
automorphism ofg with^ 2 = I. In the second part of this section, we discuss 
the real forms for the classical Lie algebras by explicitly computing the 
automorphisms This discussion follows closely the general approach of 
Hausner and Schwartz [1968]. 

Definition 15.1 Let g be a simple complex Lie algebra. 

(a) If C is a conjugation of g , then the fixed point set g — {X e g : C(X ) = 
X} is called the real form of C and C is called the conjugation of g. 

(b) Two conjugations C x and C 2 of g are called equivalent conjugations 
if the real forms of C x and C 2 are isomorphic. 

Thus from results in Section 9.2 we have for the real form g the decom¬ 
position g = g 4* ig and relative to this decomposition C(X + iY) ~ X — iY 
for X, Y eg. Also since the isomorphism of real forms defines an equivalence 
relation, it induces an equivalence relation on conjugations, and we seek a 
suitable representative for each equivalence class. 

Proposition 15.2 Two conjugations C x and C 2 of a simple complex Lie 
algebra g are equivalent if and only if there exists an automorphism (p ofg 
such that C 2 <p = <pC x . 

Proof Let^ x and g 2 be the real forms of C x and C 2 . If is an auto¬ 
morphism of g with C 2 q> = <pC u then for all X eg x we have C 2 {q>(X)) « 
<p(Ci(X)) — cp(X) so (p(X) eg 2 and cp restricted to g x is an isomorphism onto 
9i * 

Conversely if Q and C 2 are equivalent so that^ and g 2 are isomorphic, 
choose a particular isomorphism ij/: -> g 2 . Any element of g can be 

written uniquely as X + iY with X , Y eg x , so we can define <p:g-*g by 
<p(X + iY) » For all X, Y eg x 

C 2 (cp(X + iY)) = C 2 fy(X) + iHY)) 

« MX) - iMY) = <p(X - iY) = <p(C x (X + iY)) 

so C 2 <p — q>C x as required. It is easy to check that^> is an automorphism of g. 

Proposition 15.3 Let g be a simple complex Lie algebra. 

<a) If C x and C 2 are conjugations of g, then C X C 2 is an automorphism 
of g . 

(b) If C 0 is any fixed conjugation of g and C is any other conjugation, 
then C = (pC 0 where <p is an automorphism of g such that C 0 (pC 0 = 
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(c) If C 0 is a fixed conjugation of g and C t ~ <P\C 0 > C 2 = <p 2 Q) are 
two conjugations as above, then C x and C 2 are equivalent if and only if 
there exists an automorphism q> of g such that <p 2 = <P<P\Cq <p" 1 C 0 . 

Proof (a) This consists of some straightforward computations which 
will be left to the reader. 

(b) Since C 0 2 — I we note C = (CC 0 )C 0 , so C = <pC 0 where cp ~ CC 0 
is an automorphism of g. Now / = C 2 = cpC 0 (pC 0 so C 0 <pC 0 = (p* 1 . 

(c) By Proposition 15.2, C A and C 2 are equivalent if and only if there 
exists an automorphism (p with (p 2 C 0 (p = C 2 (p = q>C l = <P<P\C Q or 
92 - (p(p i C 0 <p~ 1 C 0 . 

Remarks (1) Given a real form g of a simple complex Lie algebra g , 
the Killing form of g is equal to the Killing form of g restricted to g y since a 
basis for g over the real numbers is also a basis for g over the complex num¬ 
bers This fact will be helpful in computing the Killing forms of real forms. 

(2) The Killing form of a real form g is a nondegenerate symmetric 
bilinear form over the real numbers. Consequently by Sylvester’s theorem, 
there exists an orthogonal basis for g such that Kill(2f is X ( ) = ±1 for each 
basis element X t , and the signature, or the number of 4* 1 ’s minus the number 
of — l’s, is independent of the basis chosen. The signature of the Killing form 
will be referred to as the signature of#. Notice that isomorphic real forms have 
equal signature; that is, more importantly, if two real forms have different 
signatures, then they are not isomorphic. 

Definition 15.4 A real form g of a simple complex Lie algebra is called a 
compact real form if the Killing form of g is negative definite; that is, if the 
signature of g is — m where m is the dimension of g. 

Proposition 15.5 Any simple complex Lie algebra possesses at least two 
nonisomorphic real forms, a compact real form and a split real form. The 
signature of a split real form g is equal to the rank of g; that is, equal to the 
dimension of any Cartan subalgebra of g. 

Proof Given a simple complex Lie algebra #, choose a Cartan subalgebra 
h of g and a basis for the root system of g. By Proposition 14.17, g 
possesses a Weyl basis; that is, a basis such that the Lie product of any two 
basis elements is equal to a linear combination of the basis elements with 
rational coefficients. Let g be the set of linear combinations of these basis 
elements with real coefficients. It is clear that g is a split real form of g ; 
that is, a split real Lie algebra and a real form of g. 

To compute the signature of g we construct an orthogonal basis for g 
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from the Weyl basis of g. Clearly one can choose an orthogonal basis for 
h = g nh, and the Killing form restricted to h is positive definite (note 
Proposition 13.16). Thus this part of the basis for g will contribute +n to the 
signature of g where n equals the dimension of h. For each positive root a e 01 
there exist X a eg( a) and Y, eg(-tx) so that [X a Y,] = H,' eh with [H a ’X„] = 
2X a and [H a 'Y„] — -2Y a . However, then 

« = 2 KiII(*„ Y a ) = Kill ([H a 'X t ], Y a ) 

= Kill (H tt \ [X a YJ) = Kill(tf a ', HJ) > 0, 

so we can set 

U* = (X. + Y,)la 112 and V a = (X x - Y a )/a 1/2 . 

Consequently KiIl(C/ a , U x ) = 2Kill(* a , Y x )/a = 1, KilI(K a , V x ) = - 1, 
Kill(C/ a> Up) = Kill(C/ a , H) = Y0\\(V x , H) = 0 for all Heh and positive 
roots <x, pe<%, and finally Kill(C/ a> U fi ) = Kill(F a , V fi ) = 0 for all positive 
roots cc p.It is clear that the set of all £/„’s and F a ’s complete an orthogonal 
basis lor g and contribute nothing to the signature of g. Thus the signature of g 
is n as required. 

We can alter the above basis slightly to obtain a basis for a compact real 
form of g. Thus consider an orthogonal basis for g as follows: iH k for 
k = 1,..., n with H k e h and iU„, V x as a varies over all positive roots of 01. 
Let g t denote the set of all linear combinations of these basis elements with 
real coefficients. Clearly the Killing form of g restricted to g 1 is negative 
definite, so we must merely show that g 1 is a subalgebra of g or, what is 
equivalent, that g 1 is the fixed point set of a conjugation of g. We see g x is the 
real form of the conjugation <pC where C is the conjugation of the split real 
form g, and q> is an automorphism of g such that (p(H) = -H for all H eh, 
and for any positive root a, <p(X x ) = -Y a , <p(Y a ) = -X a . It should be 
clear that 2f a ’s and Y/s can be chosen so that they satisfy the properties of the 
previous paragraph and also so that q> defines an automorphism of g; note 
Helgason [1962, p. 155] for a slightly different proof. 

Example (1) Let g = sl(n + 1, C), so it is of type A„. We have pre¬ 
viously shown that, for X, Y eg, Kill(X, Y) = 2(n + 1) trace(-XT), and a 
Weyl basis is given by setting H^ k+1 = E k k - E k+Uk+l for k = 1, 2, .... n 
and for 1 < j < k < n + 1, X— E Jk , and Y jk = E kJ . To find an orthogonal 
basis for the split real form, we first choose H u H 2 , ..., H„ eh where h is 
the set of real diagonal matrices of trace 0 such that Kill(i7*, H k ) = 1 and 
Kill(/f; ,H k ) = 0 if j # k. Also U Jk = (.E Jk + E kJ )/2(n + 1) 1/2 and V Jk = 
(Ejk ~ E kj )/2(n + 1) 1/2 for 1 < j < k < n + 1 and so clearly all these elements 
span the split real form g — sl(n + 1, R). The corresponding compact real 
form with basis iH k , k = 1, 2, .. ., n and iU Jk , V Jk , 1 <j < k < n + 1 clearly 
is the algebra su(n + 1 ) = {X eg : X~‘ = - X}. 
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Similarly in Table 15.1, g is a simple classical Lie algebra over C of the 
type listed with n the rank of g and the compact form of g up to isomorph¬ 
ism (note Proposition 15.6(b) and the book by Chevalley [1946] for a com¬ 
plete discussion of the compact spin )). The notation is given in Section 12.6, 
but we shall change it slightly later. 

TABLE 15.1 



9 

9- 

dimension g- 

A n 

(n> 1) 

su(n +1) 

n(n+ 2) 

B n 

(«> 2) 

5o(2w +1) 

n(2n -fl) 

c n 

(«>3) 

sp{n) 

n(2n + 1) 

D n 

(»> 4) 

so(2n) 

n(2n — 1) 


Proposition 15.6 Let g be a simple complex Lie algebra, and let g- be a 
compact real form with conjugation C_ . 

(a) If C is any conjugation of g , then C = <pC _ where 4> e Aut(g), 
<t> 2 = I, and C- <fr = (j)C- . 

(b) If K- is a conjugation of g corresponding to any other compact form 
of g, then C_ and AL are equivalent. 

Proof (a) From Proposition 15.3, C = where C_ 0 2 C_ = <£i , 
so all we have to do is show there is a conjugation (f>C _ equivalent to C with 
C_ <j>C- = <j>. Moreover we can assume that <f> has the form <t> =■ 1 C_ 

for some t// s Aut(g) which we give below. 

For any Zj = X x + iT„ Z 2 = X 2 + iY 2 eg=g~+ig. with X { , Y t eg _ 
we compute 

Kill(C_(Z 1 ), C_(Z 2 )) = KilKZi - IY U X 2 - iY 2 ) = KilKZj, Z 2 ) (*) 

where a is the conjugate of a in the complex numbers. We define the form on 
9 by 

(Z 2 , Z 2 ) = —Kill(Zj, C_(Z 2 )) 

and see it is a positive definite Hermitian inner product on g as follows. The 
form is linear in the first variable, additive in the second, and satisfies 
~(Z U Z 2 ) = Kill(Z 1 , C_(Z 2 )) = Kill(C_(C_(Zi)), C_(Z 2 » = Kill(C_(Z 2 ), 
C_(C_(Z 1 ))) = Kill(Z 2 , C_(Z 1 ))= — (Z 2 , ZO, using (*). Also (Z u ZJ = 
—Kill(Jf! + iY u X 2 - iY t ) = -Kill^!, Z 2 ) - KiU(y 1( Ti) > 0 so that the 
form is positive definite. 
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For an endomorphism A of g we define the adjoint A* by (AZ, W) = 
(Z, A*W) and note that if Kill(^Z, AW) = Kill(Z, W), then 

(AZ, W) = -KiJl(^Z, C~(W)) 

= — Kili(Z, A~ l C.(W)) = (Z, C.A-'C-QV)) 

so that A* = C-A~ X C- . Consequently A* = A if and only if A = C_ 
if and only if A~ x = C_ ylC_ . In particular, this gives <p x = <p x *; that is, (f> x 
is a Hermitian linear transformation so has real characteristic roots and is 
diagonalizable by a unitary transformation. If we regard (j) l as a complex 
matrix where the adjoint matrix is the transpose of its complex conjugate, 
then we can write <j> x where D is ^ real diagonal matrix and 

P* = p* = p~*. Now set if/ — PDF' 1 where D is the diagonal matrix with 
entries equal to the positive square root of the absolute values of the cor¬ 
responding diagonal entries of Z)' 1 . Note by construction that = \p. 

Using this we see 2 = Z, and C_ = \j/ = ij/*; 

that ^ is just a matrix computation using C" 

Finally ij/ is an automorphism as follows. Let i ls ..., be the characteristic 
roots of <p which give the matrix D, and let g x1? ..., be the characteristic 
spaces. Let X e g Xi and 7 e g Xj . Then [ XY] = Z e g XiX j if X h Xj is a character¬ 
istic root, otherwise [XY] — 0. Now the spaces g ki are the same characteristic 
spaces which give the matrix D for \j/ 9 and if [XY] = Z as above, then 

[mY] = \x l r ll2 \i J r l/2 z 

= \x,Xj \- i/2 z = ^z = ^n 

which yields ij/ e Aut(#). 

Now let <fi = \j/(j) l C-il/~ 1 C„ = \f/ 2 (pi. Then C_ </>C_ = </> _1 = <f> since 
(j> 2 = L Thus, by Proposition 15.3(c), 4> X C- is equivalent to <£C_ with $ 
having the required properties. Note from the construction of <j> = 
we have (j>* ~ 4> relative to the form (Z l9 Z 2 ), and therefore is diagonal- 
izable. 

(b) Let K _ be a conjugation corresponding to another compact real 
form /*_ of g. Then up to equivalence, we can write — <j)C„ where 
(j) 2 = l 9 (j)C_ — C_<£, and <£* = $ e Aut(^) as above. We also have from these 
that <p = C_K-=K.C-. Let <Z l5 Z 2 > = -Kill(Z l5 K.(Z 2 )) for Zj eg = 

-h ih„ . Then as for formula (*) in part (a), 

Kill(^-(ZO, AT_(Z 2 )) « KilKZ^ Z 2 ) = KilKC^ZO, C.(Z 2 )) (**) 

and also <Z l5 Z 2 > is a positive definite Hermitian form. 

Next <p is positive definite, for if 4>X = XX with X < 0, then using (**) 

o > x(x, xy = <</>X, X> 

= -Kill (K.(C.(X)\ K-(X)) 

= -Kill(C_(C_(X)), C_(X)) = (X, X) 
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which contradicts (Z l9 Z 2 ) is positive definite. Since (j) is diagonalizable with 
real positive characteristic roots and <j> 2 = I we see </>=/. Thus, since we have 
written K_ = <f>C_ up to equivalence, we see K_ is equivalent to C_ . 

Remark (3) We use the preceding results to classify the real forms of a 
simple complex Lie algebra g as follows. The real forms are determined by 
conjugations with isomorphic forms giving equivalent conjugations. Thus 
we must choose a suitable representative of the equivalence class which gives 
isomorphic real forms. From Propositions 15.5 and 15.6 we can choose a 
conjugation C_ corresponding to a compact real form, and it does not make 
any difference (up to equivalence) what compact form is chosen. However, 
any other conjugation C can be written C = <£C_ with the automorphism <j> 
given according to Proposition 15.6(a). Thus it suffices to discuss such auto¬ 
morphisms which we do for those algebras whose automorphism groups were 
computed in Chapter 14. 

Exercises (1) Let g__ be a compact real form of a semisimple complex 
Lie algebra g. Let S e Aut (g) be such that S 2 = /, and let k = {X e g : SX = X} 
and p = {X eg : SX = — X }. Show that the direct sum k -h ip is a real form of 
g and every real form of g is obtained this way up to an automorphism of g. 
Also note the book by Helgason [1962, pp. 152-159]. 

(2) Let g be a simple complex Lie algebra, and let h be a maximal 
compact subalgebra of g [note Section 12.6, remark (2)]. Show that h is a 
real form of g . 

Proposition 15.7 Let g = sl(n -f 1, C) for «>2 be the complex Lie 
algebra of type A n . Then C_ : X -> —X f is a conjugation of a compact real 
form. Any given conjugation of g is equivalent to precisely one of the follow¬ 
ing conjugations: 

(a) C. ; 

(b) C + :X^X; _ 

(c) C k : X -► —T k X f T k for k — 1,2,...,[(« + l)/2] where T k is a diagonal 
matrix with the first n + 1 — k entries down the diagonal equal to 1 and the 
last k equal to — 1. 

(d) C 0 : X TXT \f n + 1 is even, where Tis a matrix with all entries 0 

except for a series of the matrices [° f 0 *] down its diagonal. 

Proof Since the real form of C_ is compact [example (1)], every con¬ 
jugation of g is equivalent to some <pC_ where <p is an automorphism of g, 
<p 2 = /, and C_ <pC_ = <p. 
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From Proposition 14.22, the automorphisms of g are given by 

oW-.X-^AXA- 1 and v(A) : X-> -AX'A' 1 

where A e SL(n + 1, C). One can compute that a{A)o(B) = a(AB), a(A)z(B) — 
t (A3), t(A)<t(B) = r(A((B f )-% t(A)z(B) = a(A((Br% <r(A)~ l = o{A~% 
t (A) 1 = t{A 1 ), and g(A) = <r(B) or t(A) = t(B) if and only if B is a scalar 
multiple of A . 

We first show that any conjugation <r(A)C _ is equivalent to one of the 
conjugations in (a) or (c). We may assume o(A) 2 = a(A 2 ) = / and C_ a(,4)C_ 
= <j((A t )~ 1 ) = o(A) noting the proof of Theorem 15.6(a). Thus A 2 — al 
and AA* = bl for some complex numbers a and b . Since det A = 1 we have 
a ” +1 = b n+i = 1. By replacing A with cA where c is a square root of a 9 we 
may assume A 2 — I and AA* = bl. Now (n + 1 )b = trace(>4A*) > 0 so b = 1 
and A 1 = A' 1 = A. Thus A is a Hermitian matrix. By possibly replacing A 
with — A, we may .assume that A has 4* 1 and — 1 as its only eigenvalues with 
at least as many -f- l’s as — l’s. Moreover since A is Hermitian, it is diagonal- 
izable by a unitary matrix, so we can choose a matrix B with B* — B~ x 
and BAB' 1 equal to / or one of the matrices T k of (c). Finally 

a{B)o{A)C^a{By 1 C, = <j(BAB*) = <j{BAB _1 ), 

and so o(A)C„ is equivalent to o{BAB~ 1 )C_ which must be one of the con¬ 
jugations in (a) or (c). 

Next consider the conjugations of the type t(A)C-. Assuming / = 
t(A) 2 = cr(^(^ t ) ~ 1 )and C„t(^4)C_ = t^A')” 1 ) = z(A), we find that/404') -1 = 
al and AA* = bl. As in the first case we find that b — 1 so ( A*)~ 1 — A 
and AA — al = AA. Now al — AA — AA = al so a is real and by taking 
determinants a n+1 = 1 recalling A e SL(n + 1, C). There are two possibilities 
either a = 1 or if n + 1 is even we could have a = — 1. 

So consider t(/ 4)C_ with A A =/ and A* = A~ 1 . We claim each such 
conjugation is equivalent to t(/)C_ : X -> X, the conjugation of (b). Since 

o(B)z{A)C- <t(j8) -1 C_ = TiBAB' 1 ), 

it suffices [by Proposition 15.3(c)] to show that there exists a Be SL(n + 1, C) 
with 

BAB' 1 = I 

or equivalently B~ 1 B = A. Since A is unitary its eigenvalues have absolute 
value 1 and it is diagonalizable. Thus we need only define the action of 
B on eigenvectors of A in order to determine B completely. If X is an eigen¬ 
vector of A with corresponding eigenvalue a, then define BX = (IX where 
P 2 = a; note this gives pj 5=1. Since 

AX = A~ x X — oT x X = <xX, 
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X is also an eigenvector of A with eigenvalue a, and we can assume that we 
have BX = pX. Then 

B~ x BX = B~ x pX = p~ x pX = p 2 X = a X = AX 
and B~ 1 B — A as required. 

The last case to be considered is r(A)C„ with AA — —I, A* = A' X , 
and n + 1 is even. We shall proceed precisely as above in order to show that 
each of these conjugations is equivalent to t(T)CL where T is as in (d). 
We need to define B on eigenvectors of A in such a way that BAB' 1 = T. 
If AX - ocX , then AX — — A~ x X = -a ~ x X = ~aX, so eigenvectors of A 
occur in pairs X 9 X with corresponding eigenvalues the negatives of each other. 
If AX = aX define B x X = PX where p 2 = a. Then clearly we can assume 
B X X = PX so that B-^X = BJ x X = P~ x X = PX, B.AB^X = p 2 ocX = X 9 
and B x AB x x X — —p 2 (x,X = —X. Using the fact that A' = B X AB X X has half 
of its eigenvalues equal to -f 1 and the other half equal to — 1, A f has the 
same Jordan canonical form as T and consequently B 2 A t B 2 % = Tfor some 
B 2 . Next from the form of T notice that iT is a real matrix. Also for eigen¬ 
vectors X , X as above we see A'X — X and A'X = — X implies A'X =X 
and A'X = —X which gives A' = —A'. Therefore iA' = iA' == ( — i)(—A') — 
iA’ so that iA' is also a real matrix. This gives for the real matrix iT = 
B 2 (iA')B 2 1 . Thus the real matrices /Tand iA! can be assumed to be similar by 
a real matrix B 2 since they are similar by a complex matrix. We can conclude 
T =B 2 A'B 2 1 with B 2 a real matrix. However, then B 2 X = B 2 X and for 
B = B 2 B X we see 

BAB' 1 = (B 2 B 1 )A(B^B~ i y 1 = B 2 A'B 2 X = T 

as required. 

The fact that none of the conjugations listed are equivalent follows from 
the fact that their real forms are not isomorphic, and in fact all have distinct 
signatures. The signatures are listed in the following corollary, and showing 
that the list is correct is left as an exercise. 

Corollary 15.8 Any real form of g = sl(n + 1, C) of type A n , n> 2 
must be isomorphic to precisely one of the algebras shown in Table 15.2. 

Exercises (3) Use remark (2) of this section to verify that the correct 
signatures are given in the above tabulation. Also show that none of the 
signatures can coincide for a fixed n . 

(4) Show that the 2x2 matrices X of complex numbers such that 
T 0 XT 0 = X where T 0 = [^]form an algebra over the real number which 
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TABLE 15.2 


Name 


Signature 


(a) su(n + 1) = {X g g : X 1 = — X) 

(b) sl{n+hK)=:{Xeg : X=X) 

(c) sf(n, k,R) = {Xeg:-T k X'T k = X} 
for k= 1,2,..., [(« 4-1)/2] and 

T k as in Proposition 15.7. 

(d) If n 4 - 1 is even, sl(n -F 1/2, &) is the set of 
(n -f I)/2 x (n -f l)/2 matrices with 
entries from .2, the real quaternion 
numbers, such that the trace of the 
matrices is a quaternion number of 
trace 0. 


—/i 2 — In 
n 

1 - (w + 1 - 2k) 2 


—n — 2 


is isomorphic to the quaternion numbers. Also show that sl((n -F l)/2,is 
isomorphic to g 0 = {X eg : TXT = X} where n + 1 is even, g = sl(n + 1, C), 
and T is as in Proportion 15.7(d). 

(5) Show that any real form of g = sl( 2, C ) of type A t must be iso¬ 
morphic to sl(2, R ) or su( 2). 

Proposition 15.9 Let g be a simple complex Lie algebra of type G 2 . 
Then there are only two isomorphism classes of real forms of g, the split 
real forms and the compact real forms. 

Proof We may assume g = .@(^), the derivation algebra of # the 
complex Cayley algebra as described in example (2) of Section 9.6. Let 
X: -»denote the map which sends an element [“ “] into [£ jj] where 

these elements were defined in the example mentioned above. Clearly X 2 = /, 
X(aX) = aX(X) where a e C and X e <€, and <$+ = {X 6 : X(X) = X} is a 

split real Cayley algebra. Define C + :g-+g by C+: D-+XDX. Then it is 
easy to verify that C+ is a conjugation of g and its real form is +), a 
split algebra of type G 2 . By following the similar arguments for Lie algebras, 
it can be shown that every real Cayley algebra is the fixed point set of some 
TX with T e AutO^), and the real algebras «’ 1 and corresponding to 
Tjl and T 2 X are isomorphic if and only if there is some T e Aut & with 
T 2 = TT 1 XT~ 1 X. However, it follows from Proposition 14.23 that Au t(g) 
is equal to the set of automorphisms <p(T): D -> T DT~ 1 with T e Aut(^). 
Finally one can check that the real form of (p(T 1 )V + is where is the 

fixed point set of TX and that and ®(# 2 ) are isomorphic if and only if 

and # 2 are isomorphic. Since we have mentioned that there are only two 
isomorphism classes of real Cayley algebras, the same must be true for their 
derivation algebras. 
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Proposition 15.10 Let g = 9(n, C ) be the simple complex Lie algebra of 
type D„ and assume n > 5; that is, g = {X egl(2n, C ): LX'L = -X} where 



The following is a listing of one representative from each isomorphism class 
of real forms of g. 

(a) 3(n, k, R) = {X eg: - T k X'T k = X} where k = 0, 1, 2, ..., n and 


T k = 


■ I 

0 

0 

0 

0 

0 

0 

I 

0 

0 

I 

0 

0 

I 

0 

0 


I }n-k 


}k 

}n—k 

}k 


The signature of 2(n, k, R) is 2k(2n - k) - n(2n - 1), and for k — 0 the 
form is compact. 

(b) 3{n, -1, R) = {X e g : - T.^TZ J = X} where 


T-t = I 

' 7 

0 ‘ 

0 

-/ _ 


The signature of 9(n, —1, R) is - 

-n. 



}n 

}«. 


Proof We must first describe Aut(g). It was seen in Chapter 14 that the 
inner automorphisms of g consists of all <p(T): X -> TXT -1 where LTL = 
T _1 and det(T) = 1. It is clear that <p(T ) is an automorphism even if det(T) = 
— 1, and it is claimed that such (p(T )'s are not inner automorphisms. Thus 
Aug(g) = {(p(T): LTL = T 1 and det T = +1}. To show that we cannot have 
<P(7 2 ) = <P(J 1 ) with det(T 2 ) = — 1 and det(T,) = +1, merely notice that this 
would imply ^(T 1 “ 1 T 2 ) = 7 so = ±7 and det(±7) = 1 # -1 = 

detCZT 1 ^). 

It is clear that 9(m, 0, R) — {X e g : —X* = X} is a compact real form so 
denote its conjugation by C_ : X —> —X*. Every conjugation of g is equivalent 
to one of the type <p(T)C _ where <p(T) 2 = cp(T 2 ) = 7 and C_ <p(T)C_ = 
<p((T'y v ) = (p(T). The first condition implies T 2 = +7 and the second 
implies TT* = 7. 
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We first consider the case T 2 = I. Then Thas eigenvalues ± 1 and we can 
assume that the +1 eigenspace has at least the dimension of the -1 eigen- 
space. We can consider the nondegenerate symmetric bilinear form on the 
2n-dimensional complex vector space V of column vectors defined by (x, y) = 
x'Ly so g = {X egl(V) : (Xx, y) + (x, Xy) = 0} and LT‘L = T~ l if and 
only if (Tx, Ty) = (x, y). Now with T as above, suppose Tx = x and 
Ty = -y. Then (x, y) = (Tx, y) = (x, T~ 1 y) = -(x, y) so the +1 eigenspace 
and -1 eigenspace are orthogonal with respect to (x, y). Next notice that for 
each T k of (a) we have T k 2 = I, T k T k ' = /and the +1 eigenspace is orthogonal 
to the -1 eigenspace for T k . It is also easy to verify that the -1 eigenspace 
of T k has dimension k, so it follows that there exists an 5 with LS‘L = S' 1 
and for some k, STS' 1 = T k . Since T = T" 1 = Tand Tis Hermitian, we can 
assume that S is unitary; that is, S' = S' 1 . However, then noting 

C_ ^(sr’c-W = C_ <t>(S)-\-X t ) 

= C_(-S- 1 J‘S) = S‘X(ST l = <P(S‘)(X), 

we obtain <^(S)(/>(T)C_ <j>(S)~ 1 C- = <j>(STS‘) = <j>(STS ') = 4>(T k ). There¬ 
fore by Proposition 15.3(c), ^>(T)C_ is equivalent to (j>(T k )C- . 

For the second case T 2 - -I, we repeat the now familiar agruments once 
more. Thus T has eigenvalues ±i and if Tx = ix, Ty = iy, then i(x, y) = 
(Tx, y) = (x, T~ l y) = -i(x, y), so (x, y) = 0 for all x, y in the +i eigen¬ 
space. This implies the +i and -i eigenspaces have the same dimension. 
For T_j as in statement (b) we see T 2 k = -/ and has eigenspaces like 
T, so once again we can show that <](>(T)C_ and <p(T_ 1 )C_ are equivalent. 

It remains to show that none of the algebras on our list are isomorphic. 
First we would have to verify that the correct signatures are listed and then 
check that this helps imply the desired conclusion. The calculation of the 
signatures will be omitted since they are straightforward but long and cum¬ 
bersome. The remainder of the calculations are given in the following exercise. 

Exercise (6) Assume the signatures as given in the previous pro¬ 
position. Then show that for a given n the only algebras which have the same 
signatures are 9(n, n - « 1/2 , R) and 9(n, - 1, R) if n happens to be a perfect 
square. Show that these two algebras are not isomorphic. 

Proposition 15.11 Let g = 88(n, C) with « > 2 as described in exercise 
(1), Section 13.2. Then the following list gives one representative from each 
isomorphism class of real forms: &(n, k, R) = {X eg : T k X’T k = X) where 
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for k = 0, 1, 2, n and T k is as in Proposition 15.10. The signature of 
@(n, k, R ) is (2k — n)(2n + 1) — 2k 2 , and for k = 0 the form is compact. 

Proposition 15.12 Let g — #(w, C) with n > 3. Then the following list 
gives one representative for each isomorphism class of real forms of g. 

(a) V(n, k, R) = {X eg: -S^St = X} for k = 0, 1,2,..., [n] 2] and 


' / 0 

0 

0 * 

}n — k 

0 -I 

0 

0 

)k 

0 0 

I 

0 

}n — k 

o 

O 

0 

-I m 

}k 


The signature of #(w, fc, R) is 8 k(n—k) — n(2n + 1), and for k = 0 the form 
is compact. 

(b) <t(n % -1, R) = {X eg: -T^FTZl =-Y} where is as in 
Proposition 15.10. The signature of %>(n , — 1, i?) is n. 

Exercises (7) Show that a proof of Proposition 15.11 is actually 
contained in the proof of Proposition 15.10. 

(8) Prove Proposition 15.12. 

Remark (4) We do not have enough information about the auto¬ 
morphism groups of the complex Lie algebras JD 4 , F 4 , E 6 , E 7 , and E 8 to 
classify their real forms in the same manner as the others. We list the number 
of real forms of each type and their signatures in Table 15.3 without proof. 
A very explicit proof of all of these facts can be found in the work of 
Freudenthal and De Vries [1969], 

TABLE 15.3 
Number of classes 


Type 

of real forms 

Signatures 


5 

4, 2, -4, -14, -28 

f a 

3 

4, -20, -52 

E 6 

5 

6, 2, -14, -26, -78 

E n 

4 

7, -5, -25, -133 

E 8 

3 

8, -24, -248 
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2. Representations of Real and Complex Simple 
Lie Algebras 


Since the simple complex Lie algebras and their real forms are all com¬ 
pletely reducible, the finite-dimensional representations of these algebras are 
all direct sums of irreducible ones. In this section we will describe the irre¬ 
ducible representations of the simple complex Lie algebras and their relation¬ 
ship to the representations of the real forms. 

Remark (1) For the remainder of this section g will denote a simple 
complex Lie algebra, h a Cartan subalgebra of g, M the set of nonzero roots 
of g, and 36 a basis for 31. We wish to determine the equivalence classes of 
finite-dimensional irreducible representations of g where two representations 
Ci: g~*gKV i). i=l,2, are called equivalent representations if there exists a 
nonsingular linear transformation T: V t -*V 2 with ( 2 (20 = T( i (X)T~ 1 for 
all X eg. As in the case of the adjoint representation, if C: g->ffl(V) is a 
representation of g, then V is a direct sum of weight spaces of ((/>). More 
explicitly let F(A) ={veV: (((//) - X(H)lfv = 0 for all He fi and some 
positive integer k}. Here A is any linear transformation A: h-+ C and A is 
called a weight of C if F(A) is nonzero. Let iT denote the set of all weights of 
( (including 0 perhaps). Then 

V= I F(A) 

as a vector space direct sum. 

Proposition 15.13 (a) If aei, X eg( a), le'W, and veV{X), then 
C(X)(v) e V(a + A) if a + A e and otherwise ((20 (d) = 0. 

(b) The weight spaces are in fact eigenspaces; that is, ((#)(d) = X(H)v 
for all H eh and v e F(A). 

(c) If 36 ={a t , a 2 , ..., a„}, then there exists a unique weight A 0 e iH 
such that A 0 + a k i iT for k = 1, 2, ..., n, and A 0 is called the maximal 
weight of (. 

(d) If A e "W, then A = A 0 - t k a k for some positive integers t k . 

(e) The dimension of F(A 0 ) is 1. 

Proof (a) and (b) Clearly there exists some A e if and v e F(A) 
such that C(//)(t>) = A( H)v for all H eh. Then for any X e g{ a) 

C(H)C(X)(v) = C(Z)((/f)(u) + R(H), CW](d) 

= (A(H) + a(H)X(X)(v) 
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so either £(X)(v)=0 or £(X)(v) is an eigenvector of £(H) and £(X)(v) e 
V(a + A). Now it is clear that v generates a ^-submodule of V which consists 
entirely of eigenvectors of £(h). Since V is irreducible this ^-submodule must 
be all of V. 

(c)-(e) Let A 0 £ W be a weight such that A 0 + a* 4 Hr for k = 1 ,2,..., n. 
Such a weight must exist because V is finite dimensional and so has only a 
finite number of weights, and given any weight we can continue to add a*’s 
to this weight until it satisfies this property. 

Let 0 X t eg(a k ) and 0 ^Y k sg(— <x k ) for k = 1, 2, n. Choose a 
fixed v 0 e V(X 0 ). Then since V is irreducible, it is clear that V is spanned by 
vectors of the type T(v 0 ) where Tis a product of certain ((XJs and £(Y k )’s. 
Also notice that A 0 + a k $ W implies that £(X k )(v 0 ) — 0 for k = 1 , 2, ..., n. 
We claim that V is spanned by a vector of the type T{v 0 ) where Tis a product 
of certain C(T t )’s. To see this set H k = [X k , y t ] so that H k e h and notice that 
then using Proposition 14.17 

axjKmv 0 ) = c(PoniK«to) + c(Y k mj)(v 0 ) = o 

and 


MW = cmivo) + ar k K(x k )(v 0 ) = x 0 (H k )v 0 . 

By making repeated use of the above procedure we see that we can eliminate 
any £CX*)’s from longer products. 

Now suppose that T(v o )^0, where T is a certain product of C(i*)’s 
where £(Y k ) occurs t k times. Then T(v 0 ) e F(A) where 

n 

A = ^ ^ a k • 

This proves (d). 

To see that the dimension of V(k 0 ) is 1, notice that the formula above 
guarantees that none of T(y 0 )’s, with T a nontrivial product of C(T fc )’s, are in 
K(A 0 ). Thus F(A 0 ) consists of all multiples of v 0 and the proof of (e) is 
completed. 

Finally we must show that A 0 is the unique maximal weight. Our proof has 
shown that all weights of £ can be found from a maximal weight by the formula 
in (d) and it is clear that only one weight can satisfy this property. 

Remarks (2) Proposition 15.13 implies in particular that each root 
system of a simple complex Lie algebra has a unique root of maximal height; 
that is, there is a unique root £j =1 t k a k e with £jj =1 t k maximal. 

(3) Although dim V(l 0 ) = 1 we may have dim V{X) > 1 for other 
A e In fact this is always true for the weight 0 of the adjoint represen¬ 
tation of a g with rank g > 1. 
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Proposition 15.14 For any a e 0t let H a ' e/ibe as usual so that we can 
choose Xeg( a), Y eg(-ct) with [H a 'X] — 2X, [H a 'Y] = — 27, and 
[XY] = 77 a \ Then is an integer for each Aeif. 

Proof Let g be the subalgebra of § generated by H a \ X and Y so that 
g ^ 5/(2, C). Choose any v e V(X). Then the action of g' on v generates an 
irreducible g subspace V' of V. Now C,(H a ')(v) — and must be 

an integer by Theorem 13.11. 

Example (1) Let g = 5/(2, C). Then for 0t = {-a, a} we have H a f = 
[o -?]• The weights of the adjoint representation are W x = {-a, 0, a} 
and <x(H a l ) = 2. Let V be the vector space of two-dimensional complex 
column vectors with g acting on V as usual by matrix multiplication. Then 
the weights of this representation are 2 = { —a/2> oc/2} and V(af 2) is spanned 
by the vector [£]. Finally (//«') = 1 an integer as guaranteed by the 
proposition. 

Theorem 15.15 Let g be a simple complex Lie algebra and X 0 : fi -► C 
a linear transformation such that each A 0 (//* k ) is a positive integer or 0 for 
oe* e 38. Then there exists an irreducible representation ( : g -+gl{V) with X 0 
as maximal weight. The representation is finite-dimensional, determined up to 
equivalence by A 0 and every finite-dimensional representation of g is equi¬ 
valent to some such 

Proof It is clear from the proof of Proposition 15.14 that A 0 (//* k ) 
is a nonnegative integer if X 0 is the maximal weight of a finite-dimensional 
representation of g. Therefore, all we need show is that £ is completely 
determined once we are given some 0/%e V(A 0 ), 9 and the nonnegative 
integers X 0 (H = a k . 

Now for k = 1, 2,. .«, let H k = H' ak and choose X k eg(oc k ), Y k eg{-<x k ) 
so that H k , X k , Y k forms the usual basis for a subalgebra g k of g with g k £ 
5/(2, C). By Theorem 13.11 the number a k completely determines the action of 
g k on an irreducible subspace of V with v 0 in the weight space of the maximal 
weight. More precisely C(Y k ) J (v 0 )J = 0, 1, 2for some p is a basis for 
one of the modules described in Theorem 13.11. 

It suffices to show that we can determine the action of all of g on 
basis elements of the type 

v = aY kx y'--ttY kg y*(vo) 

and we proceed by induction on Since C(X k )£(Y k ) = £(Y k )£(X k ) 

+ £(//*), CiXjWYk) * C(Y k )C(Xj) ifj ^ k and C(X k )(v 0 ) = 0, we can determine 
the action of £(X) on v for ail X e g(a) with ae^ + . For any H e fi 

i=i 
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Next to determine whether or not C(T t )(i>) is 0 we first set 

v i = C(X k ) r (v) A 0 where £(X k ) r+1 {v) = 0. 

Then the integer determines an irreducible representation of g k 

with v x as maximal weight vector. From this we determine whether or not 
C( Y k )(v) is 0. 

To show that the module generated is finite dimensional we need only 
show that for each k = 1, ..., n there are only finitely many irreducible g k 
submodules of our y-module. From the fact that each weight space of our 
module is finite dimensional and there are only a finite number of weights X 
of our representation with X(H k ) > 0, we conclude there are only finitely 
many maximal weight spaces for irreducible representations of g k . 

Finally we must determine a basis for each weight space. 

Exercise (1) Finish the proof of Theorem 15.15 by showing that a 
basis for each weight space can be determined from g and the integers 

• j fl n . 

Definition 15.16 The representations C*, k — 1,...., n of g with maximal 
weights X k such that X k (H k ) = 1 and X k (Hj) = 0 for j =£ k, are called the basic 
representations of g. 

Proposition 15.17 If g t and g 2 are finite-dimensional representations of 
g with fij: g gl(Vj), j~\, 2 and fij has maximal weight X } , then define 
Hi ® g -2 on V k ® V 2 by 

Oh ® H 2 )(X) :u®v^g,{X){u) ® v + u® g 2 (X)(v). 

Let 0 * Vj e V/Xj) for j = 1, 2, then //, ® g 2 : g -*gl{V k ® V 2 ) is a rep¬ 
resentation of g such that the submodule of V x ® V 2 generated by v x ® v 2 
is the module of an irreducible representation with maximal weight X x + X 2 . 

Proof Some trivial computations show that g x ® g 2 is a representation ; 
note Sections 9.4 and 12.5. Now 

Oh ® g 2 ){H)(Vx ® v 2 ) =(X 1 + X 2 )(H)(v 1 ® v 2 ) 

and clearly v x ® v 2 belongs to a maximal weight space of V x ® V 2 so the 
proposition follows. 

Corollary 15.18 Any finite-dimensional irreducible representation of g 
can be obtained from some tensor product of basic irreducible representations 
as in the previous proposition. 
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Proof Let £ be an irreducible representation of g with maximal weight X 

and set a k = X(H k ) iork = 1, 2. n. Each a k is a nonnegative integer so we 

can set V equal to a tensor product of F*’s with V k occurring a k times where 
H k :g -> gl(V k ) are the basic irreducible representations defined in Definition 

15.16. This tensor product yields a representation as defined in Proposition 

15.17, and the tensor product of the maximal weight vectors of the V k s 
generates a gf-submodule of V with has the same maximal weight as C* 

Exercises (2) Show that the adjoint representation of the complex 
Lie algebra of type G 2 is a basic representation. Also show that the rep¬ 
resentation of this algebra as derivations of a Cayley algebra is basic and not 
equivalent to the adjoint representation. 

(3) Is the adjoint representation of g = sl(n + 1, C) a basic representation ? 

Remark (4) Often a representation is described by adding to the Dynkin 
diagram of g the number X 0 (H k ) to the vertex of the diagram corresponding 
to the root <x k . Since the Dynkin diagram completely determines the structure 
of g and an irreducible representation ( is completely determined by the 
numbers X 0 (H k ) for its maximal weight X 0 , theoretically one could construct 
the Lie algebra and its representation using only the diagram. In practice, of 
course, the computations are prohibitive, but it is not too difficult to determine 
the maximal weight X 0 from the diagram. 


Example (2) Consider the accompanying diagram with 0’s and l’s 
attached to the roots which is the diagram for a basic irreducible represen- 

1 

° «6 

0 0 0 0 0 

o-o-o-o-o 

«2 «3 a 4 a 5 

tation of the complex Lie algebra of type £ 6 . We can set /. 0 = 
and we would like to determine the coefficients t k . We know 

X 0 (H 6 ) = 2<1 0 , a 6 y/(a 6 , a 6 > = 1 and X 0 (H k ) = 2<A 0 , a*>/<a*, a t > = 0 

for k = 1,..., 5. Also 

2<a*, at>/<a*, <**> = 2 » 2<a 1 , a 2 >/<«i, a i> = 

2<a 1 ,a 3 >/<a 1 ,a 1 > =0 
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and so forth. Thus 

Wi) = ~ 1 2 = 0 y 

^ 0 (^ 2 ) = + 2t 2 ~ h ~ 0 , 

^0(^3) = + 2/ 3 — t 4 — t 6 — 0 , 

^0(^4) — “G + 2 t 4 — f 5 = 0 , 

^ 0 (^ 5 ) “ ~~* 4 + 2^ == 0, 

^0(^6) = ^3 + 2 t 6 — 1. 

Solving the system of equations one finds that t t = t s = 1, = t 4 = / 6 = 2, 

and *3 = 3 so A 0 = a! + 2a 2 + 3a 3 4- 2a 4 + a 5 + 2a 6 . This is, in fact, the 
maximal weight of the adjoint representation of the complex Lie algebra of 
type E 6 . There are convenient tables listing the maximal weights of all of the 
basic representations for all of the simple complex Lie algebras as well as a 
listing of all of the weights of the adjoint representations in the appendix 
of the book by Freudenthal and de Vries [1969]. 

Remark (5) Given an irreducible representation C:g-+gl(V) with 
maximal weight 2 0 and with the positive roots of g y then the dimension of 
V can be obtained from Weyl’s formula 

dim V = FI < A o + P, «>/</?, a> 

ace# 

where ft * j ^ ie< h a. The proof of Weyl’s formula uses properties of rep¬ 
resentations we have not discussed so it will not be given here. A proof can 
be found in the book by Jacobson {1962]. 

Exercise (4) Let g be a complex Lie algebra of type A 3 and £ : g gl(V) 
a basic irreducible representation corresponding to the accompanying 
diagram. Assuming Weyl’s formula and assuming that we know that 

0 1 0 

o-o-o , 

«1 <*2 a 3 

^ = { a i> a 2 > a 3 > a i + a 2 > a 2 + a 3 > a i + a 2 + * 3 }. find the maximal weight 
A 0 of C as a linear combination of the a*’s and find the dimension of V. 

Examples (3) The complex Lie algebras of type A„, B„, C„, and D„ 
were all defined as certain sets of matrices so this definition includes the 
description of a representation on a set of column vectors. Thus sl(n + 1, C), 
&(n, C), W(n, C), and 3i(n, C) are descriptions of representations on spaces 
of dimension n + 1, 2n + 1, In, and In, respectively. In each case these are 
representations on spaces of minimal dimension. 
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(4) Consider two irreducible representations £ k'-9~*dK^dt k — 2, 

such that the diagram for ( 2 is obtained by permuting the numbers assigned 
to the vertices of the Dynkin diagram by reflecting them through a symmetry 
of Dynkin diagram. From the symmetries involved in constructing g and the 
representations from the diagrams it is clear that dim V 1 = dim V 2 and there 
will be a natural correspondence between weight spaces. Thus we can assume 
that V t = V 2 = V and the representations £ t and £ 2 W ‘H resemble each other 
even if they are not equivalent. For example, if g = sl(n + 1, C) for n ^ 2 
(so g is of type A„) and we consider the representations corresponding to the 
accompanying diagrams. It can be shown that in this case dim(F) = n + 1 

1 0 0 0 0 0 0 1 
o-------- 1 o-o-----o . 

Oq 0C 2 a n-l a n “l “2 ®»-l 

Cl h 

and we can let F be the vector space of complex n + 1-dimensional column 
vectors. For A e gl(n + 1, C) and x e V we can set £ x (/l)(x) = Ax the usual 
matrix multiplication and £ 2 C'4X*) == ~A'x. The statement that £i and £ 2 arc 
not equivalent is the same as saying that A —> — A‘ is not an inner automor¬ 
phism of g. 

Remark (6) We will now consider representations of real forms of a 
simple complex Lie algebra. Given a real form g of g and £: g -* gl(V) a 
representation of g, then £ restricted to g is clearly a representation of g but is 
considered to be unsatisfactory in certain ways because this gives an action 
of the real algebra g on a complex vector^ space V. This can be remedied by 
restricting the scalar multiplication on V to the real numbers, but a more 
useful solution is described in the next definition. Notice that we have altered 
our notation slightly, for the remainder of this section g, V, £ will denote the 
complex Lie algebras, vector spaces and representations and g, V, £ will 
denote real ones. 

Definition 15.19 (a) Given a finite-dimensional complex vector space 
pand C : F-> Fa nonsingular linear transformation of F when we restrict 
scalar multiplication in F to the real numbers. If C 2 = I and^C(ax) = aC(x) 
for all a e C and xeV, then C is called a conjugation of F and 

V = {xe V: C(x) = x} 

is said to be the real form of F corresponding to C. _ 

(b) Given a real form g of g, a simple complex Lie algebra, and £:<?-> 
gl(V) an irreducible representation of g, then £ is said to be real for g if 
there exists a conjugation C of F with corresponding real form F such that 
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£(X)(x) e V for all X eg and x e V. In this case the restriction £ of £ to g 
acting on V is said to be a real form of £. 

Proposition 15.20 (a) If V is a real form of V, then V is a real vector 
space with the dimension of V over R equal to the dimension of Lover C. 

(b) If l : g ->gl( V) is an irreducible representation of g, then any real 
form of £, say £ : g^gl{V), is an irreducible representation of g. 

(c) If £i : g -* gl( and £ 2 •9~*gK^i) are two equivalent irreducible 
representations of g, then £ t is real for a real form g of g if and only if £ 2 
is real for g. 

Proof (a) and (b) are trivial to prove. 

(c) Choose a T such that £ 2 (AT) = T£j(A:)T _ 1 for all X eg 

and let £ x :g-+gKV x ) be a real form of £ t . Then V 2 ={T(x): x e L,} is a 
real form of V 2 and for all X eg, x e V l we have l 2 (X)T(x) = 
T£ 1 (Z)T _1 (T(x)) = r£,(A')(x) e V 2 as required. 

Examples (5) Obviously the adjoint representation of any complex 
Lie algebra g is real for all real forms of g. 

(6) The representations of the complex Lie algebras of type A„, B n , 
C n , and D„ discussed in example (3) above are real for their split forms 
because we know that sl(n + 1, R), 9S(n, R), <6(n, R), and 2>(n, R) are rep¬ 
resentations of the split real forms in terms of real matrices of the required 
type. 

Exercises (5) Let g be the complex Lie algebra of type A„,g a compact 
real form of g, and £ : g -* gl(n + 1, C) the usual representation of g as the set 
of (n + 1) x (ft + 1) complex matrices of trace 0. Show that £ is not real for^. 

(6) Show that both basic irreducible representations of a complex Lie 
algebra of type G 2 are real for both real forms. 


3. Some Simple Real and Complex Lie Groups 

In this section we make a few brief comments on how the results we have 
obtained for representations can be applied to Lie groups. We will concern 
ourselves only with simple Lie groups; that is, Lie groups whose Lie algebras 
are among the finite-dimensional simple Lie algebras we have discussed. 
Also see Definition 10.14. 
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Definition 15.21 A Lie group <7 is said to be a linear Lie group if there 
exists some finite-dimensional vector space V and some G x c= GL(V) with 
G and G x isomorphic as Lie groups. 

Examples (1) If £ :g^gl{V) is a representation of a simple complex 
Lie algebra g and exp : gl{ V) -> GL( V) is the usual exponential map, then the 
subgroup of GL(V ) generated by exp(£(#)) is clearly a complex linear Lie 
group. If C : g^>gl(V) is a real form of (, then the subgroup of GL(V) gener¬ 
ated by exp(((#)) is a real linear Lie group. 

(2) For n > 2 let g be a complex Lie algebra of type A„ and let £ u 
£2 : § -?gKn + 1, C) be the two representations of example (4), Section 15.2. 
Thus l x and £ 2 are not equivalent. The groups generated by exp(( 1 (^)) and 
QX P{£i(g)) are equal to SL(n 4- 1, C). Therefore the complex linear Lie groups 
corresponding to the two inequivalent representations can be viewed as the 
same set of matrices. 

Remark (1) Recall that the centers of connected simple Lie groups are 
discrete and are finite if, in addition, the groups are compact. If G is a con¬ 
nected simple Lie group, G x is the simply connected covering group of G and 
Z is the center of G u then there exists a subgroup H of Z such that G is Lie 
isomorphic to GJH. If we are given a simply connected simple Lie group 
G { and £ a representation of the Lie algebra g of G u then <7, the group 
generated by exp(CO)), is Lie isomorphic to some GJH. One reason that 
representations of Lie algebras are so important is that the subgroups H 
may not be isomorphic for inequivalent representations £. It is also known 
that for certain subgroups H of Z, the group GJH is not Lie isomorphic to 
the group generated by exp(C(^)) for any irreducible representation £ of g. 
For certain subgroups H the groups GJH may even fail to be linear groups. 
It is a difficult problem determining the relationship between the real forms of 
irreducible representations of a simple Lie algebra and the corresponding 
subgroups of the center of the appropriate simply connected group. We will be 
satisfied stating a few results for the case that the Lie algebra is simple complex 
or is a compact real form. A very interesting set of tables containing the results 
for all real forms can be found in the book by Tits [1967]. 

Proposition 15.22 If g is a compact real form of a simple complex Lie 
algebra g and £ : g g!(V) is the real form of some irreducible representation 
of g , then (7, the group generated by exp(CO)), is a real compact Lie group; 
that is, G is compact as a manifold. 
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Proof Since any covering group of a simple compact Lie group is 
compact (Section 12.6), we need only consider the case when £ is the adjoint 
representation of g. Thus let G be generated by exp(ad(#)) and recall that 

Kill(T(X), T(Y)) = Kill(X, Y) 

for all X,Y eg and T eG. We can define a positive definite bilinear form 
on g by B(X, Y) = -Kill(X, Y), and we have G e O(g) = {T e GL(g): 
B(T(X), T( Y)) = B(X, Y)}. Thus G must be compact since O(g) is compact. 

Remarks (2) A much stronger result is known and is not too difficult 
to prove, namely a simple connected Lie group is compact if and only if its 
Lie algebra is a compact real form of some complex Lie algebra. 

(3) We conclude this section by stating a theorem which lists the centers 
of simply connected, compact, simple Lie groups. Several texts, including 
those by Wolf [1967], Loos [1969], Fieudenthal and de Vries [1969], and 
Tits [1967] give a proof of the theorem or parts of it. The last book mentioned 
also lists similar information for noncompact groups and lists the centers of 
the groups generated by exp(£(K)) for various representations of simple 
Lie algebras. 

Theorem 15.23 Let G t be a connected, simply connected, simple, 
complex Lie group. Let G be the maximal compact subgroup of G,, and let 
9x and g be the Lie algebras of G, and G, respectively. Then g is a compact 
real form of g t and G is simply connected. If Z(G,) and Z(G) denote the 
centers of G x and G, then Z(G,) = Z(G) and these centers are listed in the 
Table 15.4. 

TABLE 15.4 


Type of Z(G 1 ) = Z(G) 


A n , n> 1 

Z„ + i 

B„, n> 2 

Z 2 

Cn, «>3 

Z 2 

« even, n>4 

z 2 ®z 2 

D ny n odd, n > 5 

z 4 

e 6 

Z 3 

Ei 

Z 2 

E% y F*, G 2 

{1} 
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In such texts as those by Helgason [1962], Kobayashi and Nomizu [1963], 
Loos [1969], and Wolf [1967], a differential geometric approach to Lie groups 
is used. In this chapter we briefly introduce some of the basic concepts of 
differential geometry and relate them to some of the algebra we have developed. 
We give few proofs but sufficient references. 

Thus, after generalizing differentiation in Euclidean space to “ differentia¬ 
tion ” on a manifold M by using a connection V, we discuss the basic concepts 
of geodesics, parallel translation, pseudo-Riemannian structures, and 
holonomy groups. Then we apply these results to the G-invariant connections 
on M = G/H which is a reductive homogeneous space. The G-invariant 
connections are in one-to-one correspondence with certain nonassociative 
algebras which are rather general, as shown in Fig. A.l, where the dashed 
arrow denotes a local correspondence. Next we show how these algebras 
correspond to certain multiplications /r : G/H x G/H -► G/H analogous to 
the Lie group results in Sections 1.6 and 5.3. These results generalize facts 
on Lie groups and Lie algebras and we indicate how simple nonassociative 
algebras are related to irreducible G-invariant connections on G/H and how 
to construct general results from these components. Finally we give a way 
of computing those nonassociative algebras which induce pseudo-Riemannian 
G-invariant connections on G/H in terms of a Jordan algebra of endomor- 
phisms. 

Connections 


Multiplications -— — — ■ 

Fig. A. 1 
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1. Connections on Manifolds 


We now consider the concept which generalizes differentiation in R" to 
differentiation on a C°°-manifold M. Then the geometric concepts of geodesics, 
curvature, torsion, holonomy, and Riemannian connections are discussed. 

Definition A.l Let M be a C^-manifold and let D(M) be the vector space 
of C 00 -vector fields on Af. An affine connection on JVf is an R-bilinear map 
V : D(M) x D{M) -► D(M ): (Z, Y) -► V* Y satisfying 


^ fx+g y(Z) — / V* Z + #V y Z, 
Vx(fY)=fVxY+(Xf)Y 9 

where/, g e C°°(M). The operator V* is called covariant differentiation relative 
tol. 

Note that for M = R", a vector field Y can be regarded as a function 
Y : R" -»R n ; thus the above definition reduces to that given in Chapter 1. 
Also it is shown by Kobayashi and Nomizu [1963] that connections exist on 
a manifold. 

In terms of a coordinate neighborhood U of p e M with coordinate 
function x = (jq,..., x n ) 9 the connection V is determined by n 3 real-valued 
C°°-functions T*, on U by 

v a/tal (3/a*,.) = X r*, 8idx k 

k 

where d/dx r are the usual coordinate vector fields on U. 

These functions T*, completely determine V on U, for given vector fields 
X — £ a* and Y = £ ft, d/dxj on £/, we can use the properties of V to 
show 

V*Y = ZZ latfbjdxt + Z T?,ft,)] . (*) 

k i j 

Conversely, given any real-valued C°°-functions T*, on £/, we can define 
V x Y by (*). This can then be extended to all of M provided certain compati¬ 
bility conditions are satisfied [Helgason, 1962, p. 27]. 

Definition A.2 Let M be a C°°-manifold with affine connection V, let 
a : I M be a C°°-curve in M with tangent vector field X; that is, X(t) — &(t) 
for all t in the open interval /, and let J be a closed subinterval of I. A C 00 - 
vector field Y on a is parallel along a (restricted to J) if (V* Y)(a(t)) = 0 for 
all t e J. The curve a is a geodesic if (V x X)(a(t)) = 0 for all t e J. 
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The proofs of the following results on parallel fields and geodesics can be 
found in the work of Helgason [1962], Hicks [1965] and Kobayashi and 
Nomizu [1963] and simply involve the solution to differential equations. 

Theorem A.3 Let M be a C°°-manifold with affine connection V, let 
<7 :1 -► M be a C 00 -curve in M, and let [a, b ] <= I . For each Y e T(M, v(a)) 
there is a unique C°°-vector field Y(t) on a \ [a , b ] such that Y = Y(a) and 
Y(t) is parallel along a (restricted to [a, b]). Furthermore the map 

o(a, b ): T(M, (7(a)) -> T(M, (7(b)) : Y(a) -+ Y(b) 

is a vector space isomorphism called parallel translation along a from cr(a) to 
a(b). 

Outline of proof Without loss of generality we can assume a lies in 
a coordinate neighborhood U and has no double points. For x t ,..., x n 
coordinate functions on U, we can write a(t) = £ a ( (t) d/dxfa (t)) and 
X(t) = &(t) =£ a/(t) d/d Xi (a(tj) and Y(t) = £ a t (t) 8/dx t (a(t)). Then Y(t) 
parallel along cr implies 

0 = V x r = X (Z a i dajdx t + Z Vi'ajTty d/dx k 

^ * i, j 

on a. This gives for t e [a, b] the equation for the a k , 

ddjdt + Z ajCiT^j = 0 
f, j 

fork= 1,..., n where we use the chain rule on a k (t) = a^cr^t),a n (t)). 
The unique solution to this system of linear differential equations gives the 
results concerning the isomorphism a(a , b). If a is to be a geodesic, we obtain 
for X(t) = Y(t) the system 

d 2 (7 k /dt 2 -b £ daJdt dojjdt = 0 

i>j 

whose unique solution with specified initial conditions gives the following 
result. 

Theorem A.4 Let M be a C®-manifold with affine connection V, let 
p e Af, and let X e T(M, p ). Then for any real number a there exists a real 
number e > 0 and a unique geodesic a : [a — e, a + a] -> Af such that a (a) = p 
and a(a) = X. 

Note that if we let a = e(t, p, X, a) be the curve given by the above 
theorem, then from the theory of differential equations a is a C°°-function of 
the parameters f, />, X, and a . Also note that for M = R n and Ty = 0 on M 
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we obtain from the above differential equations that the geodesics are straight 
lines. We now consider some functions related to affine connections. 

Definition A.5 Let M be a C°°-manifold with affine connection V and 
let X , L, Z be C 00 -vector fields on M. The torsion tensor, Tor is given by 
Tor(A",Y) = V x Y—V y X—[X, Y] and the curvature tensor, R is given by 

« Y)Z = (V* V y - Vy V* - V [X , y] )Z. 

We also use the notation Tor v and R v . 

Note that Tor(.T, y) and R(X , Y)Z are again vector fields which are multi¬ 
linear in X , y, Z and satisfy several algebraic identities. The following results 
can be found in the work of Hicks [1965] and Kobayashi and Nomizu [1963]. 

Theorem A.6 Let M be a C°°-manifold. 

(a) If V is an affine connection, then there exists a unique connection V 
with the same geodesics as V and Tor v = 0 on M. 

(b) Two connections V and V on M are equal if and only if they have 
the same geodesics and Tor v = Tory on M. 

Definition A.7 Let M be a C°°-manifold, let p e Af, and let S (p) denote 
the set of nondegenerate symmetric bilinear forms on T(JVf, p ). A pseudo- 
Riemannian structure on M is a map 

< , > : M U S(p) 

pe M 

such that for all /?, q e M, the bilinear forms < , >(p) and < , }(q) have the 
same index and such that < , ) is C°° as follows. For each pair of C°°-vector 
fields X, y on JVf, the function < X, Y} : M -> R given by 

<x, YXp) = <X(P), Y(p)Xp) 

is C 00 on M. We often use the notation < , } p for the bilinear form < , }(p) on 
T(M, p). If each bilinear form < , >(p) is positive definite, then < , > is a Rie- 
mannian structure on M. A pseudo-Riemannian manifold is a C°°-manifold 
with a pseudo-Riemannian structure. 

Remarks (1) If the manifold M is connected, then we see that the 
index is automatically the same on each tangent space. This uses the de¬ 
nature of the function < , > and consequently if the index did change at a 
point, then the form becomes degenerate at that point. 

(2) If M is a paracompact C°°-manifold, then there is a Riemannian 
structure on M; see the work of Singer and Thorpe [1967]. 
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Example (1) Let G be a connected Lie group and H a closed (Lie) 
subgroup. Then on the homogeneous space G/H the diffeomorphisms 
z(a) : G/H -+ G/H :pH -*apH for aeG induce the tangent maps 
T(y(aj)(p) : T(G/H , p) -+ T(G/H , ap). In particular, for us H we use 
the notation f (u) = T(x(u))(e) : T(G/H, e ) T(G/H, e) and note x(tf) = 
{t(w) : ue H} is a subgroup of GL(T(G/H , e)). Let C be a nondegenerate 
symmetric bilinear form on T(G/H, e) such that for all X, Ye T(G/H , e) we 
have C(x(w)JY, x(w)Y) = C(.Y, Y) and for p — r(a)ee G/H define a nonde¬ 
generate symmetric bilinear form < , >(p) on T(G/H , p) by 

<u 9 V}(p) = C(Tz(a~ 1 )(p)U, Tr(a~ 1 )(p)V) 

for U , VeT(G/H, p). Using C(x(u)X, f(w)Y) = C(Z, Y) for JY, Ye T(G/H, e) 
we see that < , >(p) is independent of the choice of a e G so that z(a)e = p. A 
homogeneous space with a pseudo-Riemannian metric < , > given as above 
is called a pseudo-Riemannian homogeneous space. Note that when H is com¬ 
pact there exists a positive definite form C. Thus in this case G/H is a Rieman- 
nian homogeneous space. For another example, let g and h be semisimple and 
write g = m + h with m = h 1 relative to the Killing form Kill of g. Then 
identifying m with T(G/H, e\ note that C = Killjm x m is a nondegenerate 
symmetric bilinear form such that C(x(u)X, f (u)Y) = C(X , Y) for all X , Yem. 
Thus we obtain a pseudo-Riemannian metric < , ) on G/H. 

The proof of the following can be found in the work of Helgason [1962], 
Hicks [1965], and Kobayashi and Nomizu [1963]. 

Theorem A.8 Let M be a C°°-pseudo-Riemannian manifold. Then there 
exists a unique affine connection V satisfying the following conditions. 

(a) The torsion Tor is zero. 

(b) Parallel translation preserves the bilinear form on the tangent 
spaces; that is, if X , Y are parallel vector fields along a curve a, then the func¬ 
tion Y) is a constant on cr. 

The connection V given above is called the pseudo-Riemannian connection 
relative to the pseudo-Riemannian structure on M. It should be noted that 
conditions (a) and (b) can be expressed as: 

(a') V x Y-V y X~[X,Y] = 0; 

(b') Z<JY, Y> = <V Z X, Y> + <X, V z Y> 

for C°°-vector fields X, Y, Z on M. 

We shall now show how parallel translation induces a group acting on a 
tangent space T(M,p); a detailed expository discussion of this can be found 
in the text by Nomizu [1961]. Thus let V be an affine connection on M and 
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for p e M let H(p) be the set of vector space isomorphisms of T(M, p ) ob¬ 
tained by parallel translation around all broken C°°-curves a which start and 
end at p. We shall let the same letter a also denote the parallel translation 
along the closed curve a. For two such parallel translations a, t along closed 
curves x at p, the parallel translation along the composite curve ax is just 
the endomorphism product of the parallel translations a and x. With this we 
see that H(p) becomes a group of endomorphisms of T(M, p) called the 
holonomy group of V at p. The restricted holonomy group H°(p) is the sub¬ 
group of the holonomy group H(p) obtained by restricting the parallel trans¬ 
lations to closed curves which are homotopic to 0. 

Remarks (3) If M is connected, then for each p,q e M we have H(p) 
is isomorphic to H(q) as groups. In this case we define up to isomorphism the 
holonomy group of M, denoted by Hol(V), by Hol(V) = H(p) for somep e M. 

(4) The holonomy group H(p) is a Lie group. 

(5) Let M be a pseudo-Riemannian manifold with pseudo-Riemannian 
structure < , > and with the corresponding pseudo-Riemannian connection 
V. Then we define the holonomy group of the pseudo-Riemannian connection at 
p to be those endomorphisms of H{p) which are < , >(p)-orthogonal endo¬ 
morphisms of T(M, p). Thus in this case the elements of the holonomy group 
satisfy the additional condition of preserving the pseudo-Riemannian 
structure. 

A discussion of the above remarks and the following results is given by 
Nomizu [1961] and Kobayashi and Nomizu [1963]. Let M be a connected 
Riemannian manifold with Riemannian structure < , > and let p,qe M. The 
distance d(p , q) between the two points p and q is defined to be the infimum of 
the lengths of all broken C 1 -curves joining p and q . [The length of a C^curve 
a(t) for a < t < b is < a(t ), a(t) > dt where a(t) denote the tangent vectors 
of the curve.] The function d satisfies the axioms of a metric and gives the 
same topology on M as the original manifold topology. 

Definition A.9 A Riemannian manifold M is complete if the above metric 
dis complete; that is, if every Cauchy sequence relative to d has a limit point. 

Remarks (6) Every compact Riemannian manifold is complete and 
also every Riemannian homogeneous space G/H is complete. 

(7) If M is a connected complete Riemannian manifold, then any two 
points p,qe M can be joined by a geodesic whose length equals d(p, q). 

Completeness is used in the global version of the following decomposition 
theorem due to deRham: 
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Definition A.10 Let the manifold M have affine connection V and let 
H(p) be the holonomy group at p e M. Then T(M , p) is holonomy irreducible 
if the group H(p) acts irreducibly on T(M,p). If M is connected, then M is 
holonomy irreducible if H(p) is holonomy irreducible relative to some reference 
point peM. 

Theorem A.ll Let M be a connected Riemannian manifold with Rieman- 
nian connection V. 

(a) There is an orthogonal direct sum decomposition T(M, p) — 

T 0 + T x + + T s , where T 0 ={Xe T(M,p ): AX = X all A e H(p)} and 

for 1 < i < s each T t is //(p)-invariant and irreducible. 

(b) If M is simply connected, then the decomposition in (a) is unique up 
to order and H(p) is the direct product H 0 x H x x • • • x H s of normal sub¬ 
groups, where H 0 is the identity on T(M,p) and each Hi for 1 < i < s acts 
trivially on 7} for j ^ i and irreducibly on T f . 

(c) If M is a simply connected and complete Riemannian manifold, then 
M is isometric to the direct product M 0 x M 1 x ••• x M, where M 0 is a 
Euclidean space (possibly of dimension 0) and the M, for 1 < i <s are simply 
connected complete holonomy irreducible Riemannian manifolds. This de¬ 
composition is unique up to order. 


2• Connections on Homogeneous Spaces and 
Nonassociative Algebras 

Let G be a connected Lie group with Lie algebra g and H a closed (Lie) 
subgroup with Lie subalgebra h. Then the pair (G, H) or (g , h) is called a 
reductive pair if there exists a subspace m of g so that g = m 4- h (subspace 
direct sum) and (Ad H)(m) <= m. This last condition gives the condition 
[hm] e m in terms of Lie algebras. The corresponding manifold M = G/H is 
called a reductive homogeneous space. The main example we shall be using is 
when H is semisimple and connected. In particular, when G is also semisimple 
we can decompose g = m + h where m — h L relative to the Killing form of g. 

In order to consider G-invariant connections on G/H we use the following 
definition [Helgason, 1962; Kobayashi and Nomizu, 1963]. 

Definition A.12 Let <t> : M -> M be a diffeomorphism of the manifold M 
with connection V. Then (j) is an affine map or a connection preserving map if 
4>'(V x y) = for all X , Ye D(M), where <j>' is given by (j>'X = 

[(T<j))X] o 0" 1 ; see Section 2.7. 
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Definition A. 13 Let G/H be a reductive homogeneous space and for 
aeG let z (a) : G/H ^ G/H : xH ^ axH. A connection V on G/H is a G - 
invariant connection if for all a e G the functions z{a) are affine maps. 

Let (A, a) denote a nonassociative algebra A where a is the bilinear 
multiplication on the underlying vector space A. Then continuing the same 
notation, the G-invariant connections on G/H are given by the following 
result and we use the notation of Nomizu [1954]; also note the work of 
Kobayashi and Nomizu [1968]. 

Theorem A. 14 Let G/H be a reductive homogeneous space with a fixed 
Lie algebra decomposition g = m + h such that (Ad H)(m) a m and let Ad H 
denote the induced maps on m . Then there exists a one-to-one correspondence 
between the set of all G-invariant affine connections on G/H and the set of all 
nonassociative algebras (m, a) with Ad H c Aut(w, a) which is the group of 
automorphisms of the algebra (m, a). 

Remarks (1) To obtain a(X,Y) for X, Y e m we evaluate V x * 7* 
at e = eH for certain vector fields A*, Y* determined by X, Y on a neigh¬ 
borhood N* of e. 

(2) Let H consist of the identity so that G = G/H is an ^-dimensional 
Lie group with Lie algebra g. Then any nonassociative algebra structure on 
the ^-dimensional vector space g yields a G-invariant connection on G and 
conversely. Thus as with Lie groups and Lie algebras, we can use the results 
on nonassociative algebras to study G-invariant connections. 

(3) As discussed in example (4), Section 7.3, the subspace m in the de¬ 
composition g = m + h can be made into an anticommutative algebra as 
follows. For X, Ye m let [XY] = XY + h(X, 7), where XY= [XY] m (re¬ 
spectively h(X, 7) = [ XY] h ) is the projection of [XY] in g into m (respec¬ 
tively h). This algebra is related to connections by the following result [Nomizu, 
1954; Kobayashi and Nomizu, 1968]. 

Theorem A. 15 On a reductive homogeneous space G/H with fixed de¬ 
composition g = m + h, there exists one and only one G-invariant connection 
which has zero torsion tensor and such that the curves y(t) = exp tX for all 
X e m project by n : G -► G/H into geodesics y*(f) = ny{t) in G/H. In this case 
we have a(A, 7) = \XY. This connection is called the canonical connection of 
the first kind and we shall denote the corresponding algebra by (m 7 %XY). 

These algebras are not too difficult to compute. Thus continuing exercise 
(7), Section 12.2 and example (1), Section 6.4 we have for g = so(ri), h — so(p) 
the decomposition g = m + h with m = h L consisting of matrices of the form 






2 . 


HOMOGENEOUS SPACES AND NONASSOCIATIVE ALGEBRAS 


345 


and h consisting of matrices of the form 



Consequently XY = [ X , Y] m is of the form 

[[* 11 ,^] + [Z 12 , Y 21 ] X n Y l2 -Y n X l2 \ 
l*2i Y tl - Y 21 X tl 0 J* 

Remark (4) For a G-invariant connection determined by the algebra 
(w, a) the torsion and curvature tensors evaluated at e — eH are given by 

Tor(X, Y) = a(X, Y) - a (Y, X) - XY , 

R(X, Y)Z = ol(X, a( Y, Z)) - a( Y, ql(X, Z)) - a(2f Y, Z) - [h(X, Y) Z] 

where X , Y, Z e w and XY, /*(X, Y) are given above. Letting R(X , 7) be the 
endomorphism given by R(X, Y) : m-► w : Z-► R(X, Y)Z as above and 
letting L(2Q : m -> m : 7 a(X Y) we have the following result [Nomizu, 
1961; Kobayashi and Nomizu, 1968]. 

Theorem A.16 The Lie algebra of the holonomy group Hol(V) of a G- 
invariant connection V on G/H determined by the algebra (m, a) is the 
smallest Lie algebra h* of endomorphisms of m such that: 

(a) R(X ; Y) e h* for all X, Yem; 

(b) [L(X% h*] c: h* for all Xem, 

CoroUary A.17 If V is a holonomy irreducible connection of the first 
kind given by the algebra (m, jXY), then this algebra is simple or a zero 
algebra. 

Proof Briefly, suppose XY ^ 0 and let k = L(m) + Der (m) where L{m) 
is the Lie transformation algebra of m; see Section 7.2. Then k is a Lie algebra 
of endomorphisms of m such that [L(X), k]a k for all Xem and such that 

R(X, Y) = [L{X\ L(Y)] - L(XY) - D(h(X , Y)) 

is in k for all X , Yew, This uses the formula for R(X , Y)Z and part (vi) of 
Exercise (3), Section 7.3, which shows D(h{X\ Y)) = ad Y)| m e Der(w). 
Thus from the above theorem, A* c k. Now suppose m is not simple. Then 
from the work of Sagle and Winter [1967] there exists a proper ideal n of m 
which is Der(w)-invariant. Thus h*n a kn n which contradicts the holonomy 
irreducibility. 
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We now consider pseudo-Riemannian reductive homogeneous spaces G/H 
with the pseudo-Riemannian structure given in example (1), Section 16.1 and 
have the following result [Nomizu, 1954; Kobayashi and Nomizu, 1968], 

Theorem A.18 Let G/H be a reductive homogeneous space with fixed 
Lie algebra decomposition g = m + h. Let C be a symmetric nondegenerate 
bilinear form on m which gives the pseudo-Riemannian metric < , ) on G/H 
and let V be the corresponding pseudo-Riemannian connection. Then the 
algebra (m, a) induced by V is given by 

<x(X, Y) = \XY + U(X, Y) 

where U(X , 7) = U(Y, X) is uniquely determined. Furthermore a satisfies 
C(a(Z, X), Y) 4- C(X, a(Z, Y)) = 0 and C((Ad u)X, (Ad u)Y) = C(X 9 Y) 
for all X, Y,Ze m and ue H. Denote such an algebra by ( m , a, C). 

Remarks (5) From the work of Nomizu [1954] and Kobayashi and 
Nomizu [1968], U(X, Y) is given by 

2 C(U(X, 7), Z) = C(ZX, Y) + C(X, ZY) 

for X , y, Z e m. 

(6) For a pseudo-Riemannian connection of the first kind the algebra 
( m , a) satisfies <x(X , Y) = \XY , and for X Y, Z e m, C/e h, 

C(XZ , y) = C(JT, zy) and C((ad C/)X, 7) = -C(J7, (ad C/)7). 

Note this gives the possibility of applying Theorem 12.10 concerning asso¬ 
ciative forms. Examples of such algebras occur when g and h are semi¬ 
simple, m = h 1 relative to the Killing form of g , and C = Kill | m x m. 

(7) Let (m, \XY) be an algebra with positive definite form C satisfying 
the above conditions. Let s be an ideal of m so that ss = {0} and s is maximal 
relative to this property. Then write m = s + m! an orthogonal direct sum 
and note that since C is an associative form, m' is an ideal of m such that 
sm' — {0}. Now C\m' x m' is nondegenerate and m' contains no ideals n with 
nn = {0}. Thus we can apply Theorem 12.10 to conclude m f = m 1 *f * * • + m k 
where each m i is a simple ideal. However, this corresponds to the decomposi¬ 
tion of Theorem A. 11 and we have the following result; note the work of 
Kobayashi and Nomizu [1968]. 

Corollary A.19 Let M = G/H be a simply connected Riemannian reduc¬ 
tive homogeneous space with the connection of the first kind induced by the 
algebra (m, \XY ) and let the Lie algebra of the holonomy group be nonzero. 
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(a) Then M is isometric to the direct product M 0 x M t x * * * x M k of 
Riemannian reductive homogeneous spaces such that the holonomy group 
acts trivially on M 0 and irreducibly on Af, for 1 < i < k. 

(b) Let m — m 0 + m 1 + ■ • ■ + m k be the corresponding decomposition 
of m £ T(M, e) and let s = m 0 + nt h + * * • + m it for suitable corresponding 
M 0 , M it ,..., M it . Then the algebra m = s + m* where sm — {0} and m' — 
m j ! + •*• + m Jp where the ideals ,..., m jp are simple algebras corres¬ 
ponding to the remaining holonomy irreducible spaces M j] ,..., M jp . 


3. Multiplicative Systems and Connections 


We now show how a multiplication on a reductive homogeneous space 
G/H yields a (/-invariant connection by obtaining an algebra (w, a) from the 
multiplication. This is analogous to the way the Lie algebra arises from the 
multiplication in a Lie group. Let G/H be a reductive homogeneous space and 
let 


ji: G/H x G/H -» G/H 

be an analytic function such that /u(e, c) = e where e = eH . Then the structure 
(G/H, fi) is called a multiplication. If r(if) = {r(w) : ue H} is contained in 
Aut(G/f/, the automorphism group of the multiplication, then (G/H, fi) 
is called a multiplicative system. Analogous to the computations in Sections 
1.6 and 5.3, we show how a multiplicative system yields an algebra (m, a) 
with Ad H c= Aut(m, a) and consequently yields a (7-invariant connection. 

Thus let n : G -> G/H be the natural projection and let g = m + h be a 
fixed decomposition. Then from Section 6.4 we know that for the map 
ij/ = exp | m there exists an open neighborhood D of 0 in m which is mapped 
homeomorphically into G under ip and such that n maps \//(D ) homeomorphi- 
cally onto an open neighborhood N* of e in G/H. Consequently from the 
analyticity of fi and n ° xj/ there exists a neighborhood U of 0 in m such that 
for all XYe U 

H(n exp X , n exp Y) = n exp F(X , Y) 

is in N* where F: U x U -* D is analytic at 6 = (0, 0) in m x m. 

From this fi is determined locally by F which has the Taylor’s series ex¬ 
pansion 


F(X, Y ) = F(0) + F\0)(X, Y ) + iF 2 (0)(X, Y) (2) + ■■■ 
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for X , 7 e U. As in Section 1.6 we use p(e, e) = e to see F(9 ) = 0 and we also 
have 


F x (9)(X, Y)=PX+ QY 
where PX = 0), 0 Y = F 1 (9)(0 , 7) and 

F 2 (0)(J7, 7) f2) = F 2 ( 6 >)(A; 0) (2) + F 2 (9)( 0, 7) (2) + 2 F 2 (9)[(X, 0), (0, 7)]. 

As before we define for X, Y e m, 

a(T, 7) = F 2 (9)[(X, 0), (0, 7)] 

and see this is a bilinear function a : m x m —> m. Therefore the multiplication 
ft on G/H determines a nonassociative algebra ( m , a). 

Note that the converse is true locally. Thus given a nonassociative algebra 
(w, a) we let F(X\ 7) = A" + 7 + a(A r , 7). Then there is a neighborhood £/ of 
0 in m so that for X, Y e U, p(n exp X , 7 r exp 7) = 71 exp F(X, 7) defines an 
analytic local multiplication p on some neighborhood N* of e. Note that 
2 F 2 (9)[(X, 0), (0, 7)] = F 2 (6)(X, 7) (2 > = 2 <x(X, 7) in this case. 

To obtain a connection from the algebra (m> a) induced by the multi¬ 
plicative system (G/H, pi), we need Ad H cz Aut(m, a) which follows from 
x(H) cz Aut (G/H, p) as follows. First for x e G, u e H note 

x(u)n(x) ~ uxH = uxu~ x (uH) — n(/)(u)(x) 

where cf)(u) : G -> G : x -> uxu~ x is the inner automorphism of the group G 
determined by u. Also recall from Section 7.3 that Ad u = T(</)(u))(e) and 
</>(w)(exp X) = exp(Ad u(X)). Next we have for X, 7 near enough 0 in m 

x(u)p(n exp X, n exp 7) = p(x(u)n exp X, x(u)n exp 7) 

= f.i(n<j)(u) exp X , 7i(/>(u) exp 7) 

= p(n exp Ad uX , 7 c exp Ad w 7 ) 

= 71 exp F(Ad u(X ), Ad w(7)) 

and also 


x(u)p(n exp X, 7r exp 7) = r(w)7r exp F(X, 7) 

= n(j)(u ) exp F(Z, 7) 

= 7T exp Ad w(F(.7, 7)). 

Thus we conclude Ad u(F(X, 7)) = F(Ad u(X), Ad u(Y)) using (Ad h)(m) cz m. 
This implies Ad u{a(X, 7)) = a(Ad u(X), Adw(7)) using the definition of 
a(X, 7) - F 2 (9)[(X,_ 0), (0, 7)]. 

Conversely, if Ad H cz Aut (m, a), then for F(X , Y) = X + 7 + a(Z, 7) 
we see Ad u(F(X , 7)) = F(Ad w.7, Ad w7) and consequently t(w) is an auto¬ 
morphism of the previously defined local multiplication p. Thus we obtain a 
local multiplicative system corresponding to the algebra (m, a) with 
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Ad H c Aut (m, a). We summarize the results as follows, and for more details 
see the work of Sagle and Schumi [to appear]. 

Theorem A.20 Let (GjH, g) be a multiplicative system and let g = m + h 
be the fixed Lie algebra decomposition. Let g be given locally by 
g(n exp X, it exp Y) = it exp F(X, Y) where F(X, Y) = PX + QY + iF 2 (9) 
(X, y) (2) + • • • and let a(X, Y) = F 2 (9)[(X, 0), (0, Y)]. 

(a) Then a is a bilinear function which determines an algebra ( m, a) so 
that Ad H c Aut(m, a). Thus the multiplicative system (G/H, g) induces a 
G-invariant connection on G/H. 

(b) Conversely, an algebra ( tn , a) with Ad H a Aut (m, ot ) determines a 
local multiplicative system so that when the multiplication g is represented 
by g(n exp X, it exp Y) = it exp F(X, Y) we obtain ct(X, Y) = F 2 (9)[(X, 0), 
(0, Y)]. 

Remarks (1) If the multiplicative system is a Lie group G with g the 
group multiplication, then F(X, Y) is given by the Campbell-Hausdorf 
formula and a(X, Y) = i[XY], Thus the geodisics relative to the induced 
connection are of the form y(t) = exp tX for Xe g. 

(2) Conditions on the algebra (m, a) can be obtained by considering 
identities on the multiplicative system (G/H, g). Thus (G/H, g) is power- 
associative if e is an identity element and each x e G/H generates an associative 
subsystem containing e; that is, (G/H, g) is a power associative “//-space.’ 
In particular, this means the powers x", for n a positive integer, are uniquely 
defined. Thus for each jc e G/H the map n -»x" is a well-defined homomor¬ 
phism of the positive integers under addition into (G/H, g ). In this case it can 
be shown that the algebra (m, a) is anticommutative. Thus the geodesics rela¬ 
tive to the corresponding connection are of the form n exp tX for Xsrrr, 
note the work of Nomizu [1954, Section 10]. 

A specific nonassociative example arises from the Cayley division algebra 
over R by considering the set M of elements of norm 1 (where the positive 
definite norm is used). In a paper by Schumi and Walde [to appear] it is shown 
that M = S 1 = G/H where G is of type B 3 and H is of type G 2 , and the multi¬ 
plication g is the nonassociative multiplication of the Cayley algebra. Also 
(GjH, g) is a multiplicative system and the corresponding algebra (m, a) is 
given by a(X, Y) = \XY where this anticommutative multiplication satisfies 
J(X, Y, XZ) = J(X, Y, Z)X where J(X, Y, Z) = (XY)Z + (YZ)X + (ZX)Y. 
An anticommutative algebra satisfying this identity is called a “ Malcev 
algebra”; note the work by Sagle [1965]. 

(3) Since many properties of a Lie group are given by its Lie algebra 
considered as G-invariant vector fields, we can also use the functions 
l(X): G/H -> T(G/H ) analogous to that discussed in exercise (3), Section 2.7, 
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exercise (1), Section 2.8, and exercise (1), Section 5.1 for the multiplica¬ 
tive system ( G/H , p). Thus for p e G/H and X e m we have l(X)(p) = 
[( Tp)(p , e)](0, X). Note as in the exercises l(X) is a vector field if and only if e 
is a right identity element; that is, p(x, e) — x all x e G/H. Many properties of 
1{X) are given in the work of Sagle and Schumi [to appear]. In particular, 
if (G, fi) is a multiplicative system with every l(X) a G-invariant vector 
field, then the corresponding algebra ( g , a) is given by a(X, Y ) = i[XQY] 
where QY ~F 1 (6)( 0, Y). If a is not the zero function and the corresponding 
connection is holonomy irreducible, then Q is nonsingular. 


4. Riemannian Connections and Jordan Algebras 


In the previous sections we showed how to find G-invariant connections 
on G/H in terms of multiplications and nonassociative algebras. For the 
pseudo-Riemannian case these algebras can be explicitly computed in terms 
of the algebra (m, iXY) and a certain Jordan algebra as follows; see the work 
of Sagle [to appear]. Let (w, jXY, B) denote the algebra which induced 
the pseudo-Riemannian connection of the first kind on G/H according 
to Theorem A. 18 where B is used instead of the C in Theorem A. 18. Now 
suppose G/H has another pseudo-Riemannian connection given by the 
algebra ( m , a, C). Then since the forms are nondegenerate there exists a 
unique S e GL{m) such that for all X , Ye m 

C(X , Y) = B(SX, Y). 

Furthermore using the symmetry and Ad H-invariance of B and C we have 
for all ue H, 

S b = S and [Adi*, S] = 0 (*) 

where b denotes the adjoint relative to B and Ad u = Ad u\m. Using remark 
(5), Section A.2 we obtain the formula 

2<x(X, Y) = XY + 5 _1 [^(*Sy) - (SX)Y]. 

Conversely given S e GL{m) satisfying (*) we can define C and a as above to 
obtain the algebra ( m , a, C) which induces a pseudo-Riemannian G-invariant 
connection on G/H. 

Now the set 

J = {Te End(m) : T b — T and [Ad u, T] = 0 all us H} 
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forms a Jordan algebra of endomorphisms relative to the usual multiplication 
Si • S 2 = i(SiS 2 + S 2 S 1 ); see Section 14.2 concerning type F 4 . 

Remarks (1) Using the fact that S = exp T is invertible for Te J we 
obtain a correspondence between the above connections and elements of J. 
Also note for S = / we obtain the original connection determined by (m, 
\XY,B). 

(2) In case XY= 0; that is, G/H is a symmetric space, the above results 
still hold for J. 

Using Lemma 7.15 and the obvious variation of Definition 9.12, the re¬ 
sults of Sagle [to appear] give the following: 

Theorem A.21 Let (G, H) be a reductive pair with decomposition 
g = m + h such that Ad H is completely reducible in m. Then J is a semisimple 
Jordan algebra; that is, a direct sum of simple Jordan algebras. 

Corollary A.22 Let (G, H) be a reductive pair with G and H semisimple. 
Let the decomposition g — m + h be given by m = h 1 relative to the Killing 
form Kill of g and let B = Kill | m x m. Then the algebra (m, \XY , B) deter¬ 
mines a pseudo-Riemannian connection of the first kind and the Jordan 
algebra J is semisimple. 

Example (1) Let G = SO(n) and let H = SO(p ) for p < n — 1. Then 
g = so(n) the n x n skew-symmetric matrices and embed h = so(p) as in ex¬ 
ample (1), Section 6.4. Thus m = h 1 is the set of matrices of the form 



and the spaces K 0 and K 2 of matrices of the form 



respectively, are ./-invariant. Thus J = Jq® J 2 where J 0 and J 2 are iso¬ 
morphic to the simple Jordan algebra of symmetric r x r matrices where 
r = n — p. As far as we know, it is an open problem to classify the Jordan 
algebras J for the reductive pairs ( g , h ) with g and h semisimple. 
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